Magnetic Resonance 
Spectroscopy and Imaging: 

NMR, MRI and ESR 



PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. 

PDF generated at: Sat, 14 Aug 2010 04:31:38 UTC 



Contents 



Articles 

Nuclear Magnetic Resonance 



Version 3.0 2 

Relaxation 2 

Chemical Shift 3 

Knight shift 7 

Robinson oscillator 8 

Fourier transform 8 

Discrete Fourier transform 30 

Fast Fourier transform 43 

Fourier transform spectroscopy 5 1 

Nuclear magnetic resonance 55 

NMR Spectroscopy 70 

2D-NMR 76 

User:Bci2/2D-FT NMRI and Spectroscopy 78 

Solid-state nuclear magnetic resonance 84 

Magnetic resonance microscopy 90 

Imaging 92 

Medical imaging 92 

MRI 99 

ESR Spectroscopy and Microspectroscopy 118 

ESR 118 

References 

Article Sources and Contributors 127 

Image Sources, Licenses and Contributors 129 

Article Licenses 

License 130 



1 



Nuclear Magnetic Resonance 



2 



Version 3.0 



Relaxation 

Relaxation stands quite generally for a release of tension, a return to equilibrium. 
In the sciences, the term is used in the following ways: 

• Relaxation (physics), and more in particular: 

• Relaxation (NMR), processes by which nuclear magnetization returns to the equilibrium distribution 

• Dielectric relaxation, the delay in the dielectric constant of a material 

• Structural relaxation, responsible for the glass transition 

• In mathematics: 

• Relaxation technique (mathematics), a technique for transforming hard constraints into easier ones 

• Relaxation method, for numerically solving elliptic partial differential equations 

• In c omputer s cience : 

• Relaxation, the act of substituting alternative program code during linking 
In Physiology, Hypnotism, Meditation, Recreation: 

• Relaxation technique, an activity that helps a person to relax 

• Relaxed in Flow (psychology), a state of arousal, flow, over-learned self-control and relaxation 

• Relaxation (psychology), the emotional state of low tension 

See also: 

• Tension (music) 
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Chemical Shift 



In nuclear magnetic resonance (NMR), the chemical shift describes the dependence of nuclear magnetic energy 
levels on the electronic environment in a molecule J 1 ^ ^ ^ Chemical shifts are relevant in NMR spectroscopy 
techniques such as proton NMR and carbon- 13 NMR. 

An atomic nucleus can have a magnetic moment (nuclear spin), which gives rise to different energy levels and 
resonance frequencies in a magnetic field. The total magnetic field experienced by a nucleus includes local magnetic 
fields induced by currents of electrons in the molecular orbitals (note that electrons have a magnetic moment 
themselves). The electron distribution of the same type of nucleus (e.g. 1 H, 13 C, 15 N) usually varies according to the 
local geometry (binding partners, bond lengths, angles between bonds, ...), and with it the local magnetic field at 
each nucleus. This is reflected in the spin energy levels (and resonance frequencies). The variations of nuclear 
magnetic resonance frequencies of the same kind of nucleus, due to variations in the electron distribution, is called 
the chemical shift. The size of the chemical shift is given with respect to a reference frequency or reference sample 
(see also chemical shift referencing), usually a molecule with a barely distorted electron distribution. 

The chemical shift is of great importance for NMR spectroscopy, a technique to explore molecular properties by 
looking at nuclear magnetic resonance phenomena. 



Operating frequency 

The operating frequency c^oof a magnet is calculated from the Larmor equation 

= 7 * 

where BqIS the actual strength of the magnet in units like teslas or gauss, and 7 is the gyromagnetic ratio of the 
nucleus being tested which is in turn calculated from its magnetic moment [A and spin number J with the nuclear 
magneton and the Planck constant h: 

1 = ~kT 

Thus, the proton operating frequency for a 1 T magnet is calculated as: 

2.79 x 5.05 x 10" 27 J/T Hm 

= 7^o = CM in 34T tt^v x IT = 42.5MHz 

6.62 x 10- 34 Js x (1/2) 

Chemical shift referencing 

Chemical shift 6 is usually expressed in parts per million (ppm) by frequency, because it is calculated from: 
difference in precession frequency between two nuclei 

0 = ; 

operating frequency of the magnet 
Since the numerator is usually in hertz, and the denominator in megahertz, delta is expressed in ppm. 

The detected frequencies (in Hz) for 1 H, 13 C, and 29 Si nuclei are usually referenced against TMS (tetramethylsilane) 
or DSS, which is assigned the chemical shift of zero. Other standard materials are used for setting the chemical shift 
for other nuclei. 

Thus, an NMR signal at 300 Hz from TMS at an applied frequency of 300MHz has a chemical shift of: 

300 Hz . __ 6 



= 1 x 10 _D = 1 ppm 



300 x 10 6 Hz 

Although the frequency depends on the applied field the chemical shift is independent of it. On the other hand the 
resolution of NMR will increase with applied magnetic field resulting in ever increasing chemical shift changes. 
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The induced magnetic field 

The electrons around a nucleus will circulate in a magnetic field and create a secondary induced magnetic field. This 
field opposes the applied field as stipulated by Lenz's law and atoms with higher induced fields (i.e., higher electron 
density) are therefore called shielded, relative to those with lower electron density. The chemical milieu of an atom 
can influence its electron density through the polar effect. Electron-donating alkyl groups, for example, lead to 
increased shielding while electron-withdrawing substituents such as nitro groups lead to deshielding of the nucleus. 
Not only substituents cause local induced fields. Bonding electrons can also lead to shielding and deshielding effects. 
A striking example of this are the pi bonds in benzene. Circular current through the hyperconjugated system causes a 
shielding effect at the molecule's center and a deshielding effect at its edges. Trends in chemical shift are explained 
based on the degree of shielding or deshielding. 

Nuclei are found to resonate in a wide range to the left (or more rare to the right) of the internal standard. When a 
signal is found with a higher chemical shift: 

• the applied effective magnetic field is lower, if the resonance frequency is fixed, (as in old traditional CW 
spectrometers) 

• the frequency is higher, when the applied magnetic field is static, (normal case in FT spectrometers) 

• the nucleus is more de shielded 

• the signal or shift is downfield or at low field or paramagnetic 

Conversely a lower chemical shift is called a diamagnetic shift, and is upfield and more shielded. 

Diamagnetic shielding 

In real molecules protons are surrounded by a cloud of charge due to adjacent bonds and atoms. In an applied 
magnetic field (B ) electrons circulate and produce an induced field (B ) which opposes the applied field. The 
effective field at the nucleus will be B = B Q - B.. The nucleus is said to be experiencing a diamagnetic shielding 

Factors causing chemical shifts 

Important factors influencing chemical shift are electron density, electronegativity of neighboring groups and 
anisotropic induced magnetic field effects. 

Electron density shields a nucleus from the external field. For example in proton NMR the electron-poor tropylium 
ion has its protons downfield at 9.17 ppm, those of the electron-rich cyclooctatetraenyl anion move upfield to 6.75 
ppm and its dianion even more upfield to 5.56 ppm. 

A nucleus in the vicinity of an electronegative atom experiences reduced electron density and the nucleus is therefore 
deshielded. In proton NMR of methyl halides (CH^X) the chemical shift of the methyl protons increase in the order I 
< Br < CI < F from 2.16 ppm to 4.26 ppm reflecting this trend. In carbon NMR the chemical shift of the carbon 
nuclei increase in the same order from around -10 ppm to 70 ppm. Also when the electronegative atom is removed 
further away the effect diminishes until it can be observed no longer. 

Anisotropic induced magnetic field effects are the result of a local induced magnetic field experienced by a nucleus 
resulting from circulating electrons that can either be paramagnetic when it is parallel to the applied field or 
diamagnetic when it is opposed to it. It is observed in alkenes where the double bond is oriented perpendicular to the 
external field with pi electrons likewise circulating at right angles. The induced magnetic field lines are parallel to 
the external field at the location of the alkene protons which therefore shift downfield to a 4.5 ppm to 7.5 ppm range. 
The three-dimensional space where a nucleus experiences diamagnetic shift is called the shielding zone with a 
cone-like shape aligned with the external field. 
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B 0 <=1 




The protons in aromatic compounds are shifted downfield even further with a signal for benzene at 7.73 ppm as a 
consequence of a diamagnetic ring current. 

Alkyne protons by contrast resonate at high field in a 2-3 ppm range. For alkynes the most effective orientation is 
the external field in parallel with electrons circulation around the triple bond. In this way the acetylenic protons are 
located in the cone-shaped shielding zone hence the upfield shift. 




Magnetic properties of most common nuclei 

1 13 

H and C aren't the only nuclei susceptible to NMR experiments. A number of different nuclei can also be 
detected, although the use of such techniques is generally rare due to small relative sensitivities in NMR experiments 
(compared to 1 H) of the nuclei in question, the other factor for rare use being their slender representation in 
nature/organic compounds. 



Isotope 


Occurrence 
in nature 

(%) 


spin 
number 1 


Magnetic 
moment u 


Electric quadrupole 
moment 

(<?xl0" 24 cm 2 ) 


Operating frequency 
at7T 
(MHz) 


Relative 
sensitivity 


'H 


99.984 


1/2 


2.79628 




300.13 


1 


2 H 


0.016 


1 


0.85739 


2.8 x 10" 3 


46.07 


0.0964 


io B 


18.8 


3 


1.8005 


7.4 x 10" 2 


32.25 


0.0199 


U B 


81.2 


3/2 


2.6880 


2.6 x 10" 2 


96.29 


0.165 


12 C 


98.9 


0 










13 c 


1.1 


1/2 


0.70220 




75.47 


0.0159 


14 N 


99.64 


1 


0.40358 


7.1 x 10" 2 


21.68 


0.00101 


15 N 


0.37 


1/2 


-0.28304 




30.41 


0.00104 


16 Q 


99.76 


0 










17 o 


0.0317 


5/2 


-1.8930 


-4.0 x 10" 3 


40.69 


0.0291 


19 F 


100 


1/2 


2.6273 




282.40 


0.834 


28 Si 


92.28 


0 










29 Si 


4.70 


1/2 


-0.55548 




59.63 


0.0785 



Chemical Shift 



6 



31 p 


100 


1/2 


1.1205 




121.49 


0.0664 


35 Q 


75.4 


3/2 


0.92091 


-7.9 x 10~ 2 


29.41 


0.0047 


37 C1 


24.6 


3/2 


0.68330 


-6.2 x 10~ 2 


24.48 


0.0027 


Magnetic properties of 

i - [5] 
common nuclei 















1 13 15 19 31 

H, C, N, F and P are the five nuclei that have the greatest importance in NMR experiments: 

• *H because of high sensitivity and vast occurrence in organic compounds 
13 

• C because of being the key component of all organic compounds despite occurring at a low abundance (1.1%) 

12 

compared to the major isotope of carbon C, which has a spin of 0 and therefore is NMR inactive. 

• 15 N because of being a key component of important biomolecules such as proteins and DNA 

• 19 F because of high relative sensitivity 
31 

• P because of frequent occurrence in organic compounds and moderate relative sensitivity 



Other chemical shifts 

The related Knight shift (first reported in 1949) is observed with pure metals. The NMR chemical shift in its present 
day meaning first appeared in journals in 1950. Chemical shifts with a different meaning appear in X-ray 
photoelectron spectroscopy as the shift in atomic core-level energy due to a specific chemical environment. The term 
is also used in Mossbauer spectroscopy, where similarly to NMR it refers to a shift in peak position due to the local 
chemical bonding environment. As is the case for NMR the chemical shift reflects the electron density at the atomic 
nucleus. ^ 



See also 

• Carbon- 13 NMR 

• MRI 

• NMR spectroscopy 

• 2D-FT NMRI and Spectroscopy 

• Nuclear magnetic resonance 

• Protein NMR 

• Proton NMR 

• Solid-state NMR 

• Zeeman effect 



External links 

www.chem.wisc.edu ^ 
BioMagResBank [8] 

T91 

wwwchem.csustan.edu 

T71 

Proton chemical shifts 
Carbon chemical shifts ^ 10 ^ 

1 13 

Online tutorials (these generally involve combined use of IR, H NMR, C NMR and mass spectrometry) 

rm ri2i 

• Problem set 1, advanced (see also this link for more background information on spin-spin coupling) 

ri3i 

• Problem set 2, moderate 

• Problem set 4, moderate, German language (don't let that scare you away!) 

• Problem set 5, the best! '- 15 -' 
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Combined solutions to problem set 5 (Problems 1-32) 



[16] 



and (Problems 33-64) 



[17] 
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The Knight shift is a shift in the nuclear magnetic resonance frequency of a paramagnetic substance first published 
in 1949 by the American physicist Walter David Knight. 

The Knight shift is due to the conduction electrons in metals. They introduce an "extra" effective field at the nuclear 
site, due to the spin orientations of the conduction electrons in the presence of an external field. This is responsible 
for the shift observed in the nuclear magnetic resonance. The shift comes from two sources, one is the Pauli 
paramagnetic spin susceptibility, the other is the s-component wavefunctions at the nucleus. 

Depending on the electronic structure, Knight shift may be temperature dependent. However, in metals which 
normally have a broad featureless electronic density of states, Knight shifts are temperature independent. 



Knight shift 
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Robinson oscillator 



The Robinson oscillator (or Robinson marginal oscillator) is an electronic circuit used in the field of Nuclear 
Magnetic Resonance (NMR). The oscillator forms the underlying basis of Magnetic Resonance Imaging (MRI) 
systems used in many hospitals. It was invented by the British physicist Neville Robinson. 

References 

• Deschamps, P., Vaissiere, J. and Sullivan, N. S., Integrated circuit Robinson oscillator for NMR detection ^\ 
Review of Scientific Instruments, 48(6):664-668, June 1977. DOI 10.1063/1.1135103 

Hi 

• Wilson, K. J. and Vallabhan, C. P. G., An improved MOSFET-based Robinson oscillator for NMR detection , 
Meas. Set TechnoL, l(5):458-460, May 1990. DOI 10.1088/0957-0233/1/5/015 
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Fourier transform 



In mathematics, the Fourier transform (often abbreviated FT) is an operation that transforms one complex-valued 
function of a real variable into another. In such applications as signal processing, the domain of the original function 
is typically time and is accordingly called the time domain. The domain of the new function is typically called the 
frequency domain, and the new function itself is called the frequency domain representation of the original function. 
It describes which frequencies are present in the original function. This is analogous to describing a musical chord in 
terms of the notes being played. In effect, the Fourier transform decomposes a function into oscillatory functions. 
The term Fourier transform refers both to the frequency domain representation of a function, and to the process or 
formula that "transforms" one function into the other. 

The Fourier transform and its generalizations are the subject of Fourier analysis. In this specific case, both the time 
and frequency domains are unbounded linear continua. It is possible to define the Fourier transform of a function of 
several variables, which is important for instance in the physical study of wave motion and optics. It is also possible 
to generalize the Fourier transform on discrete structures such as finite groups, efficient computation of which 
through a fast Fourier transform is essential for high-speed computing. 

Fourier transforms 

Continuous Fourier transform 

Fourier series 

Discrete Fourier transform 

Discrete-time Fourier 
transform 

Related transforms 
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Definition 

There are several common conventions for defining the Fourier transform of an integrable function / : R — > C 
(Kaiser 1994). This article will use the definition: 



roc 

/(£) = / f( x ) e _27rzx ^ dx ) for every real number g. 
J —oo 



When the independent variable x represents time (with SI unit of seconds), the transform variable g represents 
frequency (in hertz). Under suitable conditions, /can be reconstructed from f by the inverse transform: 

/(#) = / f(£) e * d£i for every real number x. 

— oo 

For other common conventions and notations, including using the angular frequency co instead of the frequency g, 
see Other conventions and Other notations below. The Fourier transform on Euclidean space is treated separately, in 
which the variable x often represents position and g momentum. 

Introduction 

The motivation for the Fourier transform comes from the study of Fourier series. In the study of Fourier series, 
complicated periodic functions are written as the sum of simple waves mathematically represented by sines and 
cosines. Due to the properties of sine and cosine it is possible to recover the amount of each wave in the sum by an 
integral. In many cases it is desirable to use Euler's formula, which states that e 2m0 = cos 2jt0 + i sin 2ji0, to write 

i0 

Fourier series in terms of the basic waves e m . This has the advantage of simplifying many of the formulas involved 
and providing a formulation for Fourier series that more closely resembles the definition followed in this article. This 
passage from sines and cosines to complex exponentials makes it necessary for the Fourier coefficients to be 
complex valued. The usual interpretation of this complex number is that it gives you both the amplitude (or size) of 
the wave present in the function and the phase (or the initial angle) of the wave. This passage also introduces the 
need for negative "frequencies". If 6 were measured in seconds then the waves e 1 and e~ m would both complete 
one cycle per second, but they represent different frequencies in the Fourier transform. Hence, frequency no longer 
measures the number of cycles per unit time, but is closely related. 

We may use Fourier series to motivate the Fourier transform as follows. Suppose that / is a function which is zero 
outside of some interval [-L/2, L/2] . Then for any T > L we may expand / in a Fourier series on the interval 
[-772,772], where the "amount" (denoted by c ) of the wave e in the Fourier series of /is given by 

T/2 



f{n/T) =c n = f e- 2 ™ x ' T f{x) dx 

J-T/2 



-T/2 

and/ should be given by the formula 

-j oo 

Ti— — OO 

If we let = n/T, and we let Ag = (n + \)IT - n/T= l/T, then this last sum becomes the Riemann sum 

oo 

/(*)= E m n )e 2 ^A^. 

71= — OO 

By letting T —> °° this Riemann sum converges to the integral for the inverse Fourier transform given in the 
Definition section. Under suitable conditions this argument may be made precise (Stein & Shakarchi 2003). Hence, 
as in the case of Fourier series, the Fourier transform can be thought of as a function that measures how much of 
each individual frequency is present in our function, and we can recombine these waves by using an integral (or 
"continuous sum") to reproduce the original function. 

The following images provide a visual illustration of how the Fourier transform measures whether a frequency is 
present in a particular function. The function depicted /(t) = cos(67rt)e _7rt oscillates at 3 hertz (if t measures 



Fourier transform 



10 



seconds) and tends quickly to 0. This function was specially chosen to have a real Fourier transform which can easily 

be plotted. The first image contains its graph. In order to calculate /(3) we must integrate e~ 2m fit). The second image 

shows the plot of the real and imaginary parts of this function. The real part of the integrand is almost always 
positive, this is because when fit) is negative, then the real part of e ~ 2m ^ i s negative as well. Because they oscillate 
at the same rate, when fit) is positive, so is the real part of e ~ lm ^ % T ne result is that when you integrate the real part 
of the integrand you get a relatively large number (in this case 0.5). On the other hand, when you try to measure a 
frequency that is not present, as in the case when we look at /(5) > me integrand oscillates enough so that the integral is 

very small. The general situation may be a bit more complicated than this, but this in spirit is how the Fourier 
transform measures how much of an individual frequency is present in a function fit). 



c -II ! 
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The Fourier transform of /(f ) 
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Fourier transform with 3 and 5 
hertz labeled. 



Properties of the Fourier transform 

An integr able function is a function /on the real line that is Lebesgue-measurable and satisfies 

/oo 
\f{x)\ dx < oo. 
-oo 

Basic properties 

Given integrable functions /(x), g(x), and h(x) denote their Fourier transforms by /(£)» anc * ^(0 

respectively. The Fourier transform has the following basic properties (Pinsky 2002). 
Linearity 

For any complex numbers a and b, if h(x) = afix) + bg(x), then h(£) = a • f(£) + b ■ • 
Translation 

For any real number x , if h(x) =fix - x Q ), then h(£) = e _2?rzxo ^/(^) . 
Modulation 

For any real number g , if h(x) = e 2mx ^0fix), then h(£) = /(£ — £o) ■ 
Scaling 

1 ~/A 

For a non-zero real number a, if h(x) =fiax), then = — :/ I — J . The case a = -1 leads to the 

\a\ \a J 

time-reversal property, which states: if h(x) =fi-x), then h(£) = /(— £) • 
Conjugation 



If =/(*), then &(£) = /(_£). 

In particular, if/is real, then one has the reality condition fi^_Q = f(£). 
And if /is purely imaginary, then — —fi{£\ 
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Convolution 



If h(x) = (f*g) (x) , then h(0 = /(£) ■ g(0- 
Uniform continuity and the Riemann— Lebesgue lemma 
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The rectangular function is Lebesgue integrable. 




-6 -5 -4 -3 -2 -1 0 12 3 4 5 6 

The sine function, the Fourier transform of the 
rectangular function, is bounded and continuous, 
but not Lebesgue integrable. 



/(£) ^ o as |e| -> oo. 

The Fourier transform f of an integrable function / is bounded and continuous, but need not be integrable - for 

example, the Fourier transform of the rectangular function, which is a step function (and hence integrable) is the sine 
function, which is not Lebesgue integrable, though it does have an improper integral: one has an analog to the 
alternating harmonic series, which is a convergent sum but not absolutely convergent. 

It is not possible in general to write the inverse transform as a Lebesgue integral. However, when both /and f are 
integrable, the following inverse equality holds true for almost every x: 



/•oo A 
-oo 



Almost everywhere, / is equal to the continuous function given by the right-hand side. If / is given as continuous 
function on the line, then equality holds for every x. 

A consequence of the preceding result is that the Fourier transform is injective on L (R). 

The Plancherel theorem and Parseval's theorem 

Let f{x) and g(x) be integrable, and let J(£)and g(£)be their Fourier transforms. If f(x) and g(x) are also 
square-integrable, then we have Parseval's theorem (Rudin 1987, p. 187): 

/'OO A 
mm rfe, 
-oo 

where the bar denotes complex conjugation. 

The Plancherel theorem, which is equivalent to Parseval's theorem, states (Rudin 1987, p. 186): 



'OO 



•oo 



— oo 



■CO 



\f{x)\'dx= m df- 



— OO 



The Plancherel theorem makes it possible to define the Fourier transform for functions in L (R), as described in 
Generalizations below. The Plancherel theorem has the interpretation in the sciences that the Fourier transform 
preserves the energy of the original quantity. It should be noted that depending on the author either of these theorems 
might be referred to as the Plancherel theorem or as Parseval's theorem. 

See Pontryagin duality for a general formulation of this concept in the context of locally compact abelian groups. 
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Poisson summation formula 

The Poisson summation formula provides a link between the study of Fourier transforms and Fourier Series. Given 
an integrable function /we can consider the periodization of /given by: 



f{x) = /(* + k ) 



5 

kez 

where the summation is taken over the set of all integers k. The Poisson summation formula relates the Fourier series 
of f to the Fourier transform off Specifically it states that the Fourier series of f is given by: 

2irikx 



f(x)~J2f(k)e- 



Convolution theorem 

The Fourier transform translates between convolution and multiplication of functions. If fx) and g(x) are integrable 
functions with Fourier transforms J(£)and respectively, then the Fourier transform of the convolution is 

given by the product of the Fourier transforms f(£\and (under other conventions for the definition of the 

Fourier transform a constant factor may appear). 
This means that if: 

/oo 
f(y)g(x-y)dy, 
-OO 

where * denotes the convolution operation, then: 

In linear time invariant (LTI) system theory, it is common to interpret g(x) as the impulse response of an LTI system 
with input fix) and output h(x), since substituting the unit impulse for fix) yields h(x) = g(x). In this case, g(£) 

represents the frequency response of the system. 

Conversely, if fix) can be decomposed as the product of two square integrable functions p(x) and q(x), then the 
Fourier transform of/(x) is given by the convolution of the respective Fourier transforms p(£)and q(£) • 

Cross-correlation theorem 

In an analogous manner, it can be shown that if h(x) is the cross-correlation of fix) and g(x): 

/oo 
f(y)g(x + y)dy 
-oo 

then the Fourier transform of h(x) is: 



As a special case, the autocorrelation of function fix) is: 



•oo 



&(*) = (/*/)(*)= / f(y)f(x + y)dy 

J —oo 

for which 



Ho=f{Of(o = \m\ 2 . 
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Eigenfunctions 

One important choice of an orthonormal basis for L (R) is given by the Hermite functions 

ol/4 

ynl 

where H n (x) are the "probabilist's" Hermite polynomials, defined by H (x) = (-1) exp(x 12) D n exp(-x 12). Under 
this convention for the Fourier transform, we have that 

In other words, the Hermite functions form a complete orthonormal system of eigenfunctions for the Fourier 
transform on L (R) (Pinsky 2002). However, this choice of eigenfunctions is not unique. There are only four 
different eigenvalues of the Fourier transform (±1 and ±i) and any linear combination of eigenfunctions with the 

2 

same eigenvalue gives another eigenfunction. As a consequence of this, it is possible to decompose L (R) as a direct 

k 

sum of four spaces H , H , H ' , and H where the Fourier transform acts on H simply by multiplication by i . This 
approach to define the Fourier transform is due to N. Wiener (Duo andikoetxe a 2001). The choice of Hermite 
functions is convenient because they are exponentially localized in both frequency and time domains, and thus give 
rise to the fractional Fourier transform used in time-frequency analysis (Boashash 2003). 



Fourier transform on Euclidean space 

The Fourier transform can be in any arbitrary number of dimensions n. As with the one-dimensional case there are 
many conventions, for an integrable function fix) this article takes the definition: 




where x and g are ^-dimensional vectors, and x • g is the dot product of the vectors. The dot product is sometimes 
written as (x, £) • 

All of the basic properties listed above hold for the /^-dimensional Fourier transform, as do Plancherel's and 
Parseval's theorem. When the function is integrable, the Fourier transform is still uniformly continuous and the 
Riemann-Lebesgue lemma holds. (Stein & Weiss 1971) 

Uncertainty principle 

Generally speaking, the more concentrated fix) is, the more spread out its Fourier transform f(£) must be. In 

particular, the scaling property of the Fourier transform may be seen as saying: if we "squeeze" a function in x, its 
Fourier transform "stretches out" in g. It is not possible to arbitrarily concentrate both a function and its Fourier 
transform. 

The trade-off between the compaction of a function and its Fourier transform can be formalized in the form of an 
Uncertainty Principle, and is formalized by viewing a function and its Fourier transform as conjugate variables 
with respect to the symplectic form on the time— frequency domain: from the point of view of the linear canonical 
transformation, the Fourier transform is rotation by 90° in the time-frequency domain, and preserves the symplectic 
form. 

Suppose fix) is an integrable and square-integrable function. Without loss of generality, assume that fix) is 
normalized: 



00 

\f(x)\ 2 dx = 1. 

— OO 

It follows from the Plancherel theorem that is also normalized. 

The spread around x = 0 may be measured by the dispersion about zero (Pinsky 2002) defined by 



Fourier transform 



14 



/oo 
x 2 \f(x)\ 2 dx. 
-oo 



In probability terms, this is the second moment of \f(x) | 2 about zero. 

The Uncertainty principle states that, if fix) is absolutely continuous and the functions xfix) and f(x) are square 
integrable, then 

D 0 (f)D 0 (f) > — ^ (Pinsky 2002). 



2/2 ^ 2 12 

The equality is attained only in the case f[x) — C\ e~ nx ' a (hence = gC\ e~ na * ) where a> 0 

2 

is arbitrary and C is such that/ is L -normalized (Pinsky 2002). In other words, where /is a (normalized) Gaussian 

function, centered at zero. 

In fact, this inequality implies that: 




oo \ / roc A \ 

2 i 



(x-x 0 y\f(x)\ 2 dx)[ (e-&ri/(0rde > 



— OO / W — oo 



2j„ /v \2| *,v\l2 

- 

IGtt 2 

for any Xo, £o in R (Stein & Shakarchi 2003). 

In quantum mechanics, the momentum and position wave functions are Fourier transform pairs, to within a factor of 
Planck's constant. With this constant properly taken into account, the inequality above becomes the statement of the 
Heisenberg uncertainty principle (Stein & Shakarchi 2003). 

Spherical harmonics 



Let the set of homogeneous harmonic polynomials of degree k on R" be denoted by A . The set A consists of the 

K K 

solid spherical harmonics of degree k. The solid spherical harmonics play a similar role in higher dimensions to the 
Hermite polynomials in dimension one. Specifically, if fix) = e 2P(x) for some P(x) in A , then 
/(£) = i~ k /(£) • Let the set H^be the closure in L (R n ) of linear combinations of functions of the form/(lxl)P(x) 

where P(x) is in A . The space L 2 (R n ) is then a direct sum of the spaces H and the Fourier transform maps each 

K K 

space H to itself and is possible to characterize the action of the Fourier transform on each space H (Stein & Weiss 
1971). Let/O) =f Q (\x\)P(x) (with P{x) in Ap, then /(£) = F 0 (|f |)P(£) where 

/'OO 

F 0 (r) = 2 m - k r-^ k -W J o f 0 (s)J {n+2k - 2)/2 (27rrs)s^ k V 2 ds. 
Here J, „, n „ n denotes the Bessel function of the first kind with order in + 2k- 2)12. When & = 0 this gives a 

(n + 2k - 2)12 

useful formula for the Fourier transform of a radial function (Grafakos 2004). 

Restriction problems 

In higher dimensions it becomes interesting to study restriction problems for the Fourier transform. The Fourier 
transform of an integrable function is continuous and the restriction of this function to any set is defined. But for a 
square-integrable function the Fourier transform could be a general class of square integrable functions. As such, the 
restriction of the Fourier transform of an L (R n ) function cannot be defined on sets of measure 0. It is still an active 
area of study to understand restriction problems in L p for 1 < p < 2. Surprisingly, it is possible in some cases to 
define the restriction of a Fourier transform to a set 5, provided S has non-zero curvature. The case when S is the unit 
sphere in R is of particular interest. In this case the Tomas- Stein restriction theorem states that the restriction of the 
Fourier transform to the unit sphere in R" is a bounded operator on L p provided 1 < p < (2n + 2) / (n + 3). 

One notable difference between the Fourier transform in 1 dimension versus higher dimensions concerns the partial 
sum operator. Consider an increasing collection of measurable sets E D indexed by R G (0,°°): such as balls of radius 
R centered at the origin, or cubes of side 2R. For a given integrable function/ consider the function/ defined by: 



f R (x)= f f(0e 2 ™*<%, ^r. 

JEr 
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Suppose in addition that /is in L p (R n ). For n = 1 and 1 < p < if one takes E = (-R, R), then/ converges to /in 

R R 

L P as R tends to infinity, by the boundedness of the Hilbert transform. Naively one may hope the same holds true for 
n> 1. In the case that E is taken to be a cube with side length R, then convergence still holds. Another natural 



candidate is the Euclidean ball E = : 1^1 < R}. In order for this partial sum operator to converge, it is necessary 
that the multiplier for the unit ball be bounded in L p (R n ). For n > 2 it is a celebrated theorem of Charles Fefferman 
that the multiplier for the unit ball is never bounded unless p = 2 (Duoandikoetxea 2001). In fact, when p * 2, this 
shows that not only may / fail to converge to /in L p , but for some functions /€ L p (R n ),f R is not even an element of 
L p . 

Generalizations 

Fourier transform on other function spaces 

It is possible to extend the definition of the Fourier transform to other spaces of functions. Since compactly 

2 

supported smooth functions are integrable and dense in L (R), the Plancherel theorem allows us to extend the 

2 2 

definition of the Fourier transform to general functions in L (R) by continuity arguments. Further JF : L (R) — » 
L (R) is a unitary operator (Stein & Weiss 1971, Thm. 2.3). Many of the properties remain the same for the Fourier 
transform. The Hausdorff- Young inequality can be used to extend the definition of the Fourier transform to include 
functions in L P (R) for 1 < p < 2. Unfortunately, further extensions become more technical. The Fourier transform of 
functions in L p for the range 2 < p < oo requires the study of distributions (Katznelson 1976). In fact, it can be shown 
that there are functions in L p with p>2 so that the Fourier transform is not defined as a function (Stein & Weiss 



This transform continues to enjoy many of the properties of the Fourier transform of integrable functions. One 
notable difference is that the Riemann-Lebesgue lemma fails for measures (Katznelson 1976). In the case that 
dpi =f(x) dx, then the formula above reduces to the usual definition for the Fourier transform off. In the case that pi is 
the probability distribution associated to a random variable X, the Fourier-Stieltjes transform is closely related to the 
characteristic function, but the typical conventions in probability theory take e b instead of e b (Pinsky 2002). In 
the case when the distribution has a probability density function this definition reduces to the Fourier transform 
applied to the probability density function, again with a different choice of constants. 

The Fourier transform may be used to give a characterization of continuous measures. Bochner's theorem 
characterizes which functions may arise as the Fourier-Stieltjes transform of a measure (Katznelson 1976). 

Furthermore, the Dirac delta function is not a function but it is a finite Borel measure. Its Fourier transform is a 
constant function (whose specific value depends upon the form of the Fourier transform used). 



1971). 



Fourier-Stieltjes transform 

The Fourier transform of a finite Borel measure pi on R" is given by (Pinsky 2002): 
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Tempered distributions 

The Fourier transform maps the space of Schwartz functions to itself, and gives a homeomorphism of the space to 
itself (Stein & Weiss 1971). Because of this it is possible to define the Fourier transform of tempered distributions. 
These include all the integrable functions mentioned above and have the added advantage that the Fourier transform 
of any tempered distribution is again a tempered distribution. 

The following two facts provide some motivation for the definition of the Fourier transform of a distribution. First let 
/and g be integrable functions, and let f and gbe their Fourier transforms respectively. Then the Fourier transform 

obeys the following multiplication formula (Stein & Weiss 1971), 

/ f( x )9( x )dx= / f(x)g(x)dx. 
Secondly, every integrable function /defines a distribution 7^.by the relation 

Tf((f) = / f{x)(f{x) dx for all Schwartz functions cp. 

JR n 

In fact, given a distribution T, we define the Fourier transform by the relation 

T((f) = T((f) for all Schwartz functions cp. 
It follows that 

Distributions can be differentiated and the above mentioned compatibility of the Fourier transform with 
differentiation and convolution remains true for tempered distributions. 

Locally compact abelian groups 

The Fourier transform may be generalized to any locally compact abelian group. A locally compact abelian group is 
an abelian group which is at the same time a locally compact Hausdorff topological space so that the group 
operations are continuous. If G is a locally compact abelian group, it has a translation invariant measure \i, called 
Haar measure. For a locally compact abelian group G it is possible to place a topology on the set of characters Q so 

that Q is also a locally compact abelian group. For a function / in L l {G) it is possible to define the Fourier 
transform by (Katznelson 1976): 

/(0= / Z(x)f(x)dfj, for any £ G G. 
Jg 

Locally compact Hausdorff space 

The Fourier transform may be generalized to any locally compact Hausdorff space, which recovers the topology but 
loses the group structure. 

Given a locally compact Hausdorff topological space X, the space A=C Q (X) of continuous complex-valued functions 
on X which vanish at infinity is in a natural way a commutative C* -algebra, via pointwise addition, multiplication, 
complex conjugation, and with norm as the uniform norm. Conversely, the characters of this algebra A, denoted 
$^4, are naturally a topological space, and can be identified with evaluation at a point of x, and one has an isometric 

isomorphism Cq(X) — > Cq($^) . In the case where X=R is the real line, this is exactly the Fourier transform. 
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Non-abelian groups 

The Fourier transform can also be defined for functions on a non-abelian group, provided that the group is compact. 
Unlike the Fourier transform on an abelian group, which is scalar-valued, the Fourier transform on a non-abelian 
group is operator- valued (Hewitt & Ross 1971, Chapter 8). The Fourier transform on compact groups is a major tool 
in representation theory (Knapp 2001) and non-commutative harmonic analysis. 

Let G be a compact Hausdorff topological group. Let 2 denote the collection of all isomorphism classes of 
finite-dimensional irreducible unitary representations, along with a definite choice of representation on the 
Hilbert space H of finite dimension d for each o G 2. If \i is a finite Borel measure on G, then the Fourier-Stieltjes 
transform of \i is the operator on H defined by 

<K,v)h.= I (U { ;k,ri}dv(g) 

•J G 

where jj^is the complex-conjugate representation of acting on H . As in the abelian case, if \i is absolutely 
continuous with respect to the left-invariant probability measure X on G, then it is represented as 

dji = fdX 

for some/G L^X). In this case, one identifies the Fourier transform of/with the Fourier-Stieltjes transform of \i. 

The mapping fi \—¥ fi defines an isomorphism between the Banach space M(G) of finite Borel measures (see rca 
space) and a closed subspace of the Banach space C (2) consisting of all sequences E = (E ) indexed by 2 of 



a 



(bounded) linear operators E^:H^^ for which the norm 



\\E\\ = sup \\E a 

is finite. The "convolution theorem" asserts that, furthermore, this isomorphism of Banach spaces is in fact an 
isomorphism of C algebras into a subspace of C (2), in which M(G) is equipped with the product given by 
convolution of measures and C (2) the product given by multiplication of operators in each index o. 

The Peter-Weyl theorem holds, and a version of the Fourier inversion formula (Plancherel's theorem) follows: if 
/€ L 2 (G), then 



2 

where the summation is understood as convergent in the L sense. 

The generalization of the Fourier transform to the noncommutative situation has also in part contributed to the 
development of noncommutative geometry. In this context, a categorical generalization of the Fourier transform to 
noncommutative groups is Tannaka-Krein duality, which replaces the group of characters with the category of 
representations. However, this loses the connection with harmonic functions. 



Alternatives 

In signal processing terms, a function (of time) is a representation of a signal with perfect time resolution, but no 
frequency information, while the Fourier transform has perfect frequency resolution, but no time information: the 
magnitude of the Fourier transform at a point is how much frequency content there is, but location is only given by 
phase (argument of the Fourier transform at a point), and standing waves are not localized in time - a sine wave 
continues out to infinity, without decaying. This limits the usefulness of the Fourier transform for analyzing signals 
that are localized in time, notably transients, or any signal of finite extent. 

As alternatives to the Fourier transform, in time-frequency analysis, one uses time-frequency transforms or 
time-frequency distributions to represent signals in a form that has some time information and some frequency 
information - by the uncertainty principle, there is a trade-off between these. These can be generalizations of the 
Fourier transform, such as the short-time Fourier transform or fractional Fourier transform, or can use different 
functions to represent signals, as in wavelet transforms and chirplet transforms, with the wavelet analog of the 
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(continuous) Fourier transform being the continuous wavelet transform. (Boashash 2003). 



Applications 

Analysis of differential equations 

Fourier transforms and the closely related Laplace transforms are widely used in solving differential equations. The 
Fourier transform is compatible with differentiation in the following sense: if fix) is a differentiable function with 
Fourier transform then the Fourier transform of its derivative is given by 27rz£/(£)- This can be used to 

transform differential equations into algebraic equations. Note that this technique only applies to problems whose 
domain is the whole set of real numbers. By extending the Fourier transform to functions of several variables partial 
differential equations with domain R n can also be translated into algebraic equations. 

FT-NMR, FT-IR, FT-NIR and MRI 

The Fourier transform is also used in nuclear magnetic resonance (NMR) and in other kinds of spectroscopy, e.g. 
infrared (FT-IR). In NMR an exponentially- shaped free induction decay (FID) signal is acquired in the time domain 
and Fourier-transformed to a Lorentzian line-shape in the frequency domain. The Fourier transform is also used in 
magnetic resonance imaging (MRI) and mass spectrometry. 



Domain and range of the Fourier transform 

It is often desirable to have the most general domain for the Fourier transform as possible. The definition of Fourier 
transform as an integral naturally restricts the domain to the space of integrable functions. Unfortunately, there is no 
simple characterizations of which functions are Fourier transforms of integrable functions (Stein & Weiss 1971). It is 
possible to extend the domain of the Fourier transform in various ways, as discussed in generalizations above. The 
following list details some of the more common domains and ranges on which the Fourier transform is defined. 

• The space of Schwartz functions is closed under the Fourier transform. Schwartz functions are rapidly decaying 
functions and do not include all functions which are relevant for the Fourier transform. More details may be found 
in (Stein & Weiss 1971). 

• The space L p maps into the space L q , where lip + l/q = 1 and 1 < p < 2 (Hausdorff- Young inequality). 

2 

• In particular, the space L is closed under the Fourier transform, but here the Fourier transform is no longer 
defined by integration. 

• The space L 1 of Lebesgue integrable functions maps into C , the space of continuous functions that tend to zero at 
infinity - not just into the space of bounded functions (the Riemann-Lebesgue lemma). 

• The set of tempered distributions is closed under the Fourier transform. Tempered distributions are also a form of 
generalization of functions. It is in this generality that one can define the Fourier transform of objects like the 
Dirac comb. 



Other notations 

Other common notations for /(£) are: /(£), F(£), F (f) (£), (Tf) F(f), F(j")> 

J~{f} and T (/(£)) .Though less commonly other notations are used. Denoting the Fourier transform by a 

capital letter corresponding to the letter of function being transformed (such as fix) and is especially common 
in the sciences and engineering. In electronics, the omega (co) is often used instead of g due to its interpretation as 
angular frequency, sometimes it is written as F(jco), where j is the imaginary unit, to indicate its relationship with the 
Laplace transform, and sometimes it is written informally as F(2jif) in order to use ordinary frequency. 
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The interpretation of the complex function /(£)may be aided by expressing it in polar coordinate form: 
/(£) = A(^)e llp ^ m terms of the two real functions A(g) and where: 

40 = 1/(01. 

is the amplitude and 

^(0 = axg (/(£)), 
is the phase (see arg function). 

Then the inverse transform can be written: 

/oo 
-oo 

which is a recombination of all the frequency components of f(x). Each component is a complex sinusoid of the 
form e b whose amplitude is A(g) and whose initial phase angle (at x = 0) is cp(£). 

The Fourier transform may be thought of as a mapping on function spaces. This mapping is here denoted JF and 
^ 7 (/)is used to denote the Fourier transform of the function/. This mapping is linear, which means that can 

also be seen as a linear transformation on the function space and implies that the standard notation in linear algebra 
of applying a linear transformation to a vector (here the function/) can be used to write ^/instead of J~(f)- 

Since the result of applying the Fourier transform is again a function, we can be interested in the value of this 
function evaluated at the value g for its variable, and this is denoted either as J-(f) (£) or as (•?""/) (0 • Notice that 
in the former case, it is implicitly understood that is applied first to / and then the resulting function is evaluated 
at g, not the other way around. 

In mathematics and various applied sciences it is often necessary to distinguish between a function /and the value of 
/when its variable equals x, denoted f(x). This means that a notation like ^(/(x)) formally can be interpreted as 

the Fourier transform of the values off at x. Despite this flaw, the previous notation appears frequently, often when a 
particular function or a function of a particular variable is to be transformed. For example, jF(rect(x)) = sinc(£) 

is sometimes used to express that the Fourier transform of a rectangular function is a sine function, or 
F(f(x + x 0 )) = JF(/(x))e 2 ^ xo is used to express the shift property of the Fourier transform. Notice, that the 

last example is only correct under the assumption that the transformed function is a function of x, not of x . 

Other conventions 

There are three common conventions for defining the Fourier transform. The Fourier transform is often written in 
terms of angular frequency: co = 2jzE, whose units are radians per second. 

The substitution g = col (2k) into the formulas above produces this convention: 



/H = f /(*> 



\e~ l "- x dx. 

Under this convention, the inverse transform becomes: 



Unlike the convention followed in this article, when the Fourier transform is defined this way, it is no longer a 
unitary transformation on L (R n ). There is also less symmetry between the formulas for the Fourier transform and its 
inverse. 

Another popular convention is to split the factor of (2jt) evenly between the Fourier transform and its inverse, which 
leads to definitions: 



J y > (2tt)"/ 2 y R „ M 1 
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f(x) = - \ / f(u)e i0J - x duo. 

2 n 

Under this convention, the Fourier transform is again a unitary transformation on L (R ). It also restores the 
symmetry between the Fourier transform and its inverse. 

Variations of all three conventions can be created by conjugating the complex-exponential kernel of both the forward 
and the reverse transform. The signs must be opposites. Other than that, the choice is (again) a matter of convention. 



Summary of popular forms of the Fourier transform 



ordinary frequency £ (hertz) 


unitary 


Att) = / /(*)« '-'' h " C - dx = / a (2xfl = (27r)"/ 2 /3(2*-e) 

/(*)= / Ui)^ ix< di 


angular frequency co (rad/s) 


non -unitary 


Ah d = f / f{x)e*"dx = h Vf] = (2t)" /2 Ah 

J R n \2ir/ 


unitary 


= I /, / Moo)**"" 



The ordinary-frequency convention (which is used in this article) is the one most often found in the mathematics 
literature. In the physics literature, the two angular-frequency conventions are more commonly used. 

As discussed above, the characteristic function of a random variable is the same as the Fourier-Stieltjes transform of 
its distribution measure, but in this context it is typical to take a different convention for the constants. Typically 

characteristic function is defined E(e lt ' X ) = / e lt ' x dfix{ x ) • ^ s m me case °f me "non-unitary angular 

frequency" convention above, there is no factor of 2jt appearing in either of the integral, or in the exponential. 
Unlike any of the conventions appearing above, this convention takes the opposite sign in the exponential. 

Tables of important Fourier transforms 

The following tables record some closed form Fourier transforms. For functions f(x) , g(x) and h(x) denote their 
Fourier transforms by f , g , and ^ respectively. Only the three most common conventions are included. It is 

sometimes useful to notice that entry 105 gives a relationship between the Fourier transform of a function and the 
original function, which can be seen as relating the Fourier transform and its inverse. 

Functional relationships 

The Fourier transforms in this table may be found in (Erdelyi 1954) or the appendix of (Kammler 2000). 
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Function 


Fourier transform 

unitafv nt*rliTi5ii*v fi*pnnpripv 

111111(11 > , Ul 11111(11 y 11 CUUCllt V 


Fourier transform 

nnitarv 3110111^1* frpmipnpv 

Ulll laL j y rtllgUlrtl IICUUCIILV 


Fourier transform 

nnn -iinit»i*v ancmlai* 
1 1 1 j 1 1 u 1 1 1 1 ti 1 v ^ ctiiguicti 

frequency 


Remarks 




/(*) 


fOO 

J — oo 


/» = 4= r ^ x)e ~™ dx 


roc 

/>)= / f(x)e~^dx 

J — OO 


Definition 


101 


a ■ f(x) + b ■ g(x) 




a ■ J(lu) + b-g(v) 


a- f{y) + b-g(v) 


Linearity 


102 


f(x - a) 








Shift in 

time 

domain 


103 


e 2 * iax f(x) 


/(£-") 


J{uj — 2-ko) 


f{v - 2na) 


Shift in 
frequency 
domain, 
dual of 
102 


104 


f(ax) 


|a/(a) 


,V(-) 

\a\ \aj 


iV(-) 

\a\ \a/ 


Scaling in 
the time 
domain. If 
\a is 
large, then 
f(ax) is 
concentrated 
around 0 
and 

rV (-) 

spreads out 
and 

flattens. 


105 


/» 




/(-") 




Duality. 
Here f 

needs to be 
calculated 
using the 
same 

method as 

Fourier 

transform 

column. 

Results 

from 

swapping 

"dummy" 

variables 

of x and 

£ or CJ or 

I/. 


106 


d n f(x) 
dx 11 


(2wie)"/(0 








107 


x n f(x) 


/ i yd- ho 

\27tJ d£ n 


du) n 




This is the 
dual of 
106 
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108 










The 

notation 

f*9 
denotes the 
convolution 
of /and 
9 — this 
rule is the 

pnnvnlntinn 

theorem 


109 


f{x)g{x) 








This is the 
dual of 
108 


110 


For f(x) a purely 
real 


H-o = m) 


/(-") = /(w) 


/(-") = /» 


Hermitian 
symmetry. 

Z 

indicates 
the 

complex 
conjugate. 


111 


For f(x) a purely 
real even function 


f{w) , /(£) an d /(^) are purely real even functions. 




112 


For f[x) a purely 
real odd function 


/S. A, A 

f(uj) , /(0 anc * /(^) are purely imaginary odd functions. 





Square-integrable functions 

The Fourier transforms in this table may be found in (Campbell & Foster 1948), (Erdelyi 1954), or the appendix of 
(Kammler 2000). 





Function 


Fourier transform 
unitary, ordinary frequency 


Fourier transform 
unitary, angular frequency 


Fourier transform 
non -unitary, angular 
frequency 


Remarks 




/(*) 


/■oo 

f(() = / f(x)e- 2 ™t dx 

J — OO 




too 

f(y)= / f(x)e-^dx 

J — OO 




201 


rect(ax) 


— • sine I — 
\a\ \a ) 


1 


1 . ( w \ 

, • sine n 

y/2na 2 \2iraJ 




1 . ( v \ 

. • sine 
a\ \2tclJ 


The rectangular 
pulse and the 
normalized sine 
function, here 
defined as 
sinc(x) = 
sin(jrx)/(jrx) 


202 


sinc(ax) 


— • rect ( — 
\a\ \aj 


I 


1 / (J \ 
V2na 2 \2na) 




1 ' rect (n ) 
a\ \2iraJ 


Dual of rule 
201. The 
rectangular 
function is an 
ideal low-pass 
filter, and the 
sine function is 
the non-causal 
impulse 

response of such 
a filter. 
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203 



sine 2 (ax) 




/2na 2 



tri 



a VZira/ 



The function 
tri(x) is the 
triangular 
function 



204 



tn(ax) 



1 



a 



sine — 




sine 



V27raJ 



1 



a 



■ 2f » ' 

sine - — 
V2xa. 



Dual of rule 
203. 



205 



1 



\/27r(a + i'jj) 



a + iv 



The function 
m(jc) is the 
Heaviside unit 
step function 
and a>0. 



206 



-ax 2 




— • e n 
a 



1 _^ 

e 4 « 



V 7 ^ 



'7T 

a 



This shows that, 
for the unitary 
Fourier 

transforms, the 
Gaussian 
function 
exp(-ax ) is its 
own Fourier 
transform for 
some choice of 
a. For this to be 
integrable we 
must have 
Re(a)>0. 
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-a\x\ 



2a 



a 2 + 4tt 2 £ 2 



a 



7r ar + to 



2a 



a 2 + v A 



For a>0. That 
is, the Fourier 
transform of a 
decaying 
exponential 
function is a 
Lorentzian 
function. 
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J n (x) 



x 



n 

• vT~ - 47r 2 £ 2 rcct(7r£) 



2 7 



7r n 



\/l — c<J 2 rect ^— ^ 



n 
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The functions / 
(x) are the rc-th 
order Bessel 
functions of the 
first kind. The 
functions U (x) 

n 

are the 
Chebyshev 
polynomial of 
the second kind. 
See 315 and 316 
below. 
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Hyperbolic 
secant is its own 
Fourier 
transform 
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Distributions 

The Fourier transforms in this table may be found in (Erdelyi 1954) or the appendix of (Kammler 2000). 





Function 


Fourier transform 
unitary, ordinary frequency 


Fourier transform 
unitary, angular frequency 


Fourier transform 
non-unitary, angular frequency 






Remarks 


f(x) 


/OO 

/(e) = / f(x)e~ 2 ^ dx 

J — OO 


/•oo 

/(c) = / /(xje - ** dx 

V Z7T -/— OC 


/(„)= / f{z)e-"dx 

J — OO 






m 


V^TT • S(lj) 


27Td(z/) 


The distribution 6(g) denotes the Dirac delta fi 


x) 


i 


1 
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Dual of rule 301. 
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V ( 27r ■ 6(lj — a) 


27rS(v — a) 


This follows from 103 and 301. 
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V2tt- 
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tt - a) + + a)) 


This follows from rules 101 and 303 using Eu 
formula: cos(ax) = (e iax + e~ iax )/2 
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2 V 2tt • 
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This follows from 101 and 303 using 

sin(a:r) = (e iax - e~ iax )/(2i). 






Z 2 












>s(ax 2 ) 


/7T / 7T 2 £ 2 7r\ 

Va C ° S i a 4 J 


1 /LJ 2 7r\ 

v^a C ° U V4a 4 J 


Va°°"U 4 J 




i(ax 2 ) 




-1 . (U 2 7r\ 
, Sill 

v% V 4tt 4 / 


Pk . (v 2 7r\ 
V a Sm V4a 4 / 




i 




V27T/ W 
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Here, n is a natural number and $( n ) (£)is th< 

distribution derivative of the Dirac delta funct 
This rule follows from rules 107 and 301. Cor 
this rule with 101, we can transform all polync 






-wr sgn($) 


V 2 


— 27T sgn(V) 


Here sgn(£) is the sign function. Note that llx 
distribution. It is necessary to use the Cauchy 
principal value when testing against Schwartz 
functions. This rule is useful in studying the H 
transform. 
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llx 1 is the homogeneous distribution defined t 
regularizing the singularity via 

1 r» *(*) - EE* x k <i> (k) 

rfTL / ryTl 

A j J — OO X 
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o sin(7ra/2)r(a + 1) 


-2 sin(7ra/2)r(a+ 1) 
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If Re a > -1, then \x \ is a locally integrable 






|27re| a+1 
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function, and so a tempered distribution. The i 
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x\ n is a holomorphic function from ti 












right half- plane to the space of tempered 












distributions. It admits a unique meromorphic 












extension to a tempered distribution, also dene 
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a for a * -2, -4, ... (See homogeneous 
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The dual of rule 309. This time the Fourier 
transforms need to be considered as Cauchy pi 
value. 
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The function u(x) is the Heaviside unit step fu 
this follows from rules 101, 301, and 312. 
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This function is known as the Dirac comb fun< 
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The function ^Ot) is the zeroth order Bessel fi 
of first kind. 
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This is a generalization of 315. The function J 
the n-th order Bessel function of first kind. Th 
function T (x) is the Chebyshev polynomial oi 

n 

first kind. 



Two-dimensional functions 



Function 


Fourier transform 
unitary, ordinary frequency 


Fourier transform 
unitary, angular frequency 


Fourier transform 
non-unitary, angular frequency 


Ren 




/(£-,&) = // f(x,y)e- 2 ^ x+ ^dxdy 


/(w.,c*) = ^ ff f(x,y)e- i ^-^dxdy 


f(u„u,)= J J f&yy-*"***"'* dxdy 


The 

vari; 

£,£ 

CO , 1 

y 

and 

real 

num 

The 

intej 
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over 

entii 

plan 


e -Tr(a 2 x 2 +b 2 y 2 ) 


1 p -7r(^/a 2 +^/b 2 ) 
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Formulas for general n -dimensional functions 





Function 


Fourier transform 
unitary, ordinary frequency 


Fourier transform 
unitary, angular frequency 


Fourier transform 
non -unitary, angular 
frequency 


Remarks 




/(*) 


/(f) = / f(x)e- 2 ™<d n x 


/(«)=,„ 1/2, / me—dTx 


/(„)= / f(x)e-^"<Tx 
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7r-^+l)|<f|- ( ^- 5 
•J„ /2+5 (27r|e|) 


2- 5 r(^ + i)|^p (n/2M 

■Jn/2+6(\v\) 


-{n/2)-S 
■Jn/2+s(W\) 


The 

function 

^[0,i] isthe 
indicator 

function of 

the interval 

[0,1]. The 

function 

r(x) is the 

gamma 

function. 

The 

function 

J n/2 + d* 

Bessel 
function of 
the first 
kind with 
order 
n/2+5. 
Taking 
n - 2 and 
6=0 
produces 
402. (Stein 
& Weiss 
1971, Thm. 
4.13) 
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|x| _a , 0 < Rea < n. 








See Riesz 

potential. 

The 

formula 

also holds 

for all a * 

-n, -72-1, 

...by 

analytic 

continuation, 

but then the 

function 

and its 

Fourier 

transform 

need to be 

understood 

as suitably 

regularized 

tempered 

distributions. 

See 

homogeneous 
distribution. 
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See also 

• Fourier series 

• Fast Fourier transform 

• Laplace transform 

• Discrete Fourier transform 

• DFT matrix 

• Discrete -time Fourier transform 

• Fourier— Deligne transform 

• Fractional Fourier transform 

• Linear canonical transform 

• Fourier sine transform 

• Short-time Fourier transform 

• Fourier inversion theorem 

• Analog signal processing 

• Transform (mathematics) 

• Integral transform 

• Hartley transform 

• Hankel transform 
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External links 

• Fourier Series Applet (Tip: drag magnitude or phase dots up or down to change the wave form). 

• Stephan Bernsee's FFTlab ^ (Java Applet) 

• Stanford Video Course on the Fourier Transform ^ 

• Tables of Integral Transforms ^ at EqWorld: The World of Mathematical Equations. 

• Weisstein, Eric W., "Fourier Transform from Math World. 

• Fourier Transform Module by John H. Mathews 

• The DFT "a Pied": Mastering The Fourier Transform in One Day ^ at The DSP Dimension 

• An Interactive Flash Tutorial for the Fourier Transform 
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Discrete Fourier transform 



Fourier transforms 

Continuous Fourier transform 

Fourier series 

Discrete Fourier transform 

Discrete-time Fourier 
transform 

Related transforms 

In mathematics, the discrete Fourier transform (DFT) is a specific kind of Fourier transform, used in Fourier 
analysis. It transforms one function into another, which is called the frequency domain representation, or simply the 
DFT, of the original function (which is often a function in the time domain). But the DFT requires an input function 
that is discrete and whose non-zero values have a limited (finite) duration. Such inputs are often created by sampling 
a continuous function, like a person's voice. Unlike the discrete-time Fourier transform (DTFT), it only evaluates 
enough frequency components to reconstruct the finite segment that was analyzed. Using the DFT implies that the 
finite segment that is analyzed is one period of an infinitely extended periodic signal; if this is not actually true, a 
window function has to be used to reduce the artifacts in the spectrum. For the same reason, the inverse DFT cannot 
reproduce the entire time domain, unless the input happens to be periodic (forever). Therefore it is often said that the 
DFT is a transform for Fourier analysis of finite-domain discrete-time functions. The sinusoidal basis functions of 
the decomposition have the same properties. 

The input to the DFT is a finite sequence of real or complex numbers (with more abstract generalizations discussed 
below), making the DFT ideal for processing information stored in computers. In particular, the DFT is widely 
employed in signal processing and related fields to analyze the frequencies contained in a sampled signal, to solve 
partial differential equations, and to perform other operations such as convolutions or multiplying large integers. A 
key enabling factor for these applications is the fact that the DFT can be computed efficiently in practice using a fast 
Fourier transform (FFT) algorithm. 

FFT algorithms are so commonly employed to compute DFTs that the term "FFT" is often used to mean "DFT" in 
colloquial settings. Formally, there is a clear distinction: "DFT" refers to a mathematical transformation or function, 
regardless of how it is computed, whereas "FFT" refers to a specific family of algorithms for computing DFTs. The 
terminology is further blurred by the (now rare) synonym finite Fourier transform for the DFT, which apparently 
predates the term "fast Fourier transform" (Cooley et al., 1969) but has the same initialism. 

Definition 

The sequence of N complex numbers x , x^_ 1 is transformed into the sequence of N complex numbers X , 

X by the DFT according to the formula: 

AM 

N-l 

X k = J2 X n^ kn * = <>,..., JV-1 

71=0 

where i is the imaginary unit and e ^pis a primitive N'th root of unity. (This expression can also be written in terms 

of a DFT matrix; when scaled appropriately it becomes a unitary matrix and the X^ can thus be viewed as coefficients 
of x in an orthonormal basis.) 

The transform is sometimes denoted by the symbol J? , as in X = T {x} or J 7 (x) or J^x.- 
The inverse discrete Fourier transform (IDFT) is given by 
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i N-l 

x n = ^J2 X ^ kn n = 0,...,N-l 

^ k=0 



A simple description of these equations is that the complex numbers represent the amplitude and phase of the 
different sinusoidal components of the input "signal" x n . The DFT computes the X k from the x n , while the 
IDFT shows how to compute the X n 2& a sum of sinusoidal components (^\/N)X] c e^ rkn wrtri frequency k/N 

cycles per sample. By writing the equations in this form, we are making extensive use of Euler's formula to express 
sinusoids in terms of complex exponentials, which are much easier to manipulate. In the same way, by writing X^ 
in polar form, we obtain the sinusoid amplitude ^/-/Vand phase 0^.from the complex modulus and argument of 

Xk , respectively: 

A k = \X k \ = y/Re(X k )* + Im(X k )*, 

ip k = arg(X fc ) = atan2(lm(X fc ),Re(X fc )), 
where atan2 is the two-argument form of the arctan function. Note that the normalization factor multiplying the DFT 
and IDFT (here 1 and 1/AO and the signs of the exponents are merely conventions, and differ in some treatments. The 
only requirements of these conventions are that the DFT and IDFT have opposite- sign exponents and that the 



product of their normalization factors be l/N. A normalization of \j y J\[ for both the DFT and IDFT makes the 

transforms unitary, which has some theoretical advantages, but it is often more practical in numerical computation to 
perform the scaling all at once as above (and a unit scaling can be convenient in other ways). 

(The convention of a negative sign in the exponent is often convenient because it means that X k is me amplitude of 
a "positive frequency" 2nk / N . Equivalently, the DFT is often thought of as a matched filter: when looking for a 

frequency of +1, one correlates the incoming signal with a frequency of -1.) 

In the following discussion the terms "sequence" and "vector" will be considered interchangeable. 

Properties 

Completeness 

The discrete Fourier transform is an invertible, linear transformation 
T : C N -> 

with C denoting the set of complex numbers. In other words, for any N>0, an A/-dimensional complex vector has a 
DFT and an IDFT which are in turn Af-dimensional complex vectors. 

Orthogonality 

The vectors e ^r kn form an orthogonal basis over the set of Af-dimensional complex vectors: 
N-l 



£ (e^) (e- 2 ^) = N 8 kk , 



n=0 

where 5^/ is the Kronecker delta. This orthogonality condition can be used to derive the formula for the IDFT from 
the definition of the DFT, and is equivalent to the unitarity property below. 
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The Plancherel theorem and Parseval's theorem 

If X 1 and Y 1 are the DFTs of x and y respectively then the Plancherel theorem states: 

k k n J n r J 

N-1 i N-1 

E ^y: = E w 

n=0 iV fc=0 

where the star denotes complex conjugation. Parseval's theorem is a special case of the Plancherel theorem and 
states: 



N-1 1 N-1 



EW = ^EI* 2 

n=0 iV fc=0 



These theorems are also equivalent to the unitary condition below. 

Periodicity 

If the expression that defines the DFT is evaluated for all integers k instead of just for k = 0, . . . , N — 1, then 
the resulting infinite sequence is a periodic extension of the DFT, periodic with period N. 
The periodicity can be shown directly from the definition: 

N-1 N-1 N-1 

x k+N d ^ e ^- ¥(fe+A °" = E ^-^C3 = E ^ e " ¥fc " = 

n=0 n=0 1 n=0 

Similarly, it can be shown that the IDFT formula leads to a periodic extension. 

The shift theorem 

Multiplying x n by a linear phase e ^ L nm for some integer m corresponds to a circular shift of the output Xj^ : 

Xjc is replaced by X^- m , where the subscript is interpreted modulo N (i.e., periodically). Similarly, a circular 

shift of the input x n corresponds to multiplying the output Xj^by a linear phase. Mathematically, if {x n } 

represents the vector x then 
if = X k 

then f({ Xn . e^ m }) k = X k _ m 
and ^({x n _ m }) fc = X k • e~^ k ™ 

Circular convolution theorem and cross-correlation theorem 

The convolution theorem for the continuous and discrete time Fourier transforms indicates that a convolution of two 
infinite sequences can be obtained as the inverse transform of the product of the individual transforms. With 
sequences and transforms of length N, a circularity arises: 



fc=0 



- N-1 /N-1 \ /N-1 



N 

k=0 \ 1=0 / \m=0 

iV-1 / 1 N-1 \ 

1=0 m=0 \ k=0 J 

The quantity in parentheses is 0 for all values of m except those of the form n — I — pN , where p is any integer. 

At those values, it is 1 . It can therefore be replaced by an infinite sum of Kronecker delta functions, and we continue 
accordingly. Note that we can also extend the limits of m to infinity, with the understanding that the x and y 
sequences are defined as 0 outside [0,N-1]: 
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N—l oo / oo 

3m,(n-l-pN) 

1=0 m=— oo \p=— oo 

TV— 1 oo / oo 

— ^/ I 2/m * ^(n-I-piV) 



Z=0 p= — oo \m=—oc 

Vn-l-pN 



V v ' 



N-l / oo 

def 



= ^2 X{ [ ^2 Vn-l-pN J = (X * y N ) n , 
1=0 \p=-oo / 

which is the convolution of the x sequence with a periodically extended y sequence defined by: 

oo 

(yN)n = 22 y( n ~p N )- 

p=—oo 

Similarly, it can be shown that: 

N-l 

{x * ■ Y}„ = £ x? • (y N ) n+l = (x * y N ) n , 

1=0 

which is the cross-correlation of x and yN • 

A direct evaluation of the convolution or correlation summation (above) requires 0(7V 2 ) operations for an output 

sequence of length N. An indirect method, using transforms, can take advantage of the 0(N log N) efficiency of 

the fast Fourier transform (FFT) to achieve much better performance. Furthermore, convolutions can be used to 
efficiently compute DFTs via Rader's FFT algorithm and Bluestein's FFT algorithm. 

Methods have also been developed to use circular convolution as part of an efficient process that achieves normal 
(non-circular) convolution with an xor y sequence potentially much longer than the practical transform size (N). 
Two such methods are called overlap-save and overlap-add^ . 

Convolution theorem duality 

It can also be shown that: 

N-l 

T {x • y} k = ^2x n -y n - 



27ri kn 



n=0 

1 

= — (X * Yn)/j, which is the circular convolution of Xand Y 



Trigonometric interpolation polynomial 

The trigonometric interpolation polynomial 



p(t) = [X 0 + X ie H + ■■■ + Xjv/a-ieW 2 - 1 ^' + X N/2 coa(Nt/2) + X N/2+1 e ( -- N / 2+ ^ it + ■■■ + X N _ ie 

N even , 

p(t) = [X 0 + X ie lt + ■■■ + X lN/2] e^ lt + X LJV /2j+ie- LJV/2Ji< + ■ • • + Iw-ie" 1 '] for N 
odd, 

where the coefficients are given by the DFT of above, satisfies the interpolation property p(27rn/N) = x n 
for n = 0, . . . , N — 1. 

For even N, notice that the Nyquist component — ^jl^ cos( Nt/2) is handled specially. 
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This interpolation is not unique: aliasing implies that one could add N to any of the complex- sinusoid frequencies 
(e.g. changing g _ ^to e i(N-l)t) without changing the interpolation property, but giving different values in 

between the x n points. The choice above, however, is typical because it has two useful properties. First, it consists 
of sinusoids whose frequencies have the smallest possible magnitudes, and therefore minimizes the mean- square 

slope I \p f (t) \ 2 dt of the interpolating function. Second, if the x n are real numbers, then p[t) is real as well. 

In contrast, the most obvious trigonometric interpolation polynomial is the one in which the frequencies range from 
0 to TV — 1 (instead of roughly —N/2to +7V/2as above), similar to the inverse DFT formula. This 

interpolation does not minimize the slope, and is not generally real-valued for real X n ; its use is a common mistake. 



The unitary DFT 

Another way of looking at the DFT is to note that in the above discussion, the DFT can be expressed as a 
Vandermonde matrix: 



F = 



UJ 



UJ 



00 

N 

1-0 

N 



UJ 



UJ 



0- 1 
N 

1- 1 

N 



UJ 



UJ 



O-(JV-l) - 
N 

l.(N-l) 
N 



(AT-l)-O (iV-l)-l 



UJ 



N 



UJ 



(N-l).(N-l) 
N 



where 



is a primitive Nth root of unity. The inverse transform is then given by the inverse of the above matrix: 

F-i = _L F * 



With unitary normalization constants 1 / yJV > the DFT becomes a unitary transformation, defined by a unitary 
matrix: 

u = f/Vn 
u _1 = u* 

det(U)| = 1 

where det( ) is the determinant function. The determinant is the product of the eigenvalues, which are always ±1 or 
±i as described below. In a real vector space, a unitary transformation can be thought of as simply a rigid rotation 
of the coordinate system, and all of the properties of a rigid rotation can be found in the unitary DFT. 
The orthogonality of the DFT is now expressed as an orthonormality condition (which arises in many areas of 
mathematics as described in root of unity): 
N-l 

m=0 

If Xis defined as the unitary DFT of the vector xthen 

N-l 

n=0 

and the Plancherel theorem is expressed as: 

7V-1 N-l 

£ x n y* n = £ X k Y; 

n=0 k=0 

If we view the DFT as just a coordinate transformation which simply specifies the components of a vector in a new 
coordinate system, then the above is just the statement that the dot product of two vectors is preserved under a 
unitary DFT transformation. For the special case X = y, this implies that the length of a vector is preserved as 
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well — this is just Parseval's theorem: 

N-l N-l 
n=0 k=0 

Expressing the inverse DFT in terms of the DFT 

A useful property of the DFT is that the inverse DFT can be easily expressed in terms of the (forward) DFT, via 
several well-known "tricks". (For example, in computations, it is often convenient to only implement a fast Fourier 
transform corresponding to one transform direction and then to get the other transform direction from the first.) 

First, we can compute the inverse DFT by reversing the inputs: 

T-\{x n }) = F{{x N . n })/N 

(As usual, the subscripts are interpreted modulo N; thus, for 77, = 0, we have %N-0 — ^0.) 
Second, one can also conjugate the inputs and outputs: 

•F -1 (x) = F(x*)*/N 

Third, a variant of this conjugation trick, which is sometimes preferable because it requires no modification of the 
data values, involves swapping real and imaginary parts (which can be done on a computer simply by modifying 
pointers). Define swap( x n ) as X n with its real and imaginary parts swapped — that is, if x n = a + bi then 
swap( X n ) is I) -\- ai • Equivalently, swap( X n ) equals ix n . Then 

.F _1 (x) = swap(^(swap(x)))/7V 
That is, the inverse transform is the same as the forward transform with the real and imaginary parts swapped for 
both input and output, up to a normalization (Duhamel et al, 1988). 

The conjugation trick can also be used to define a new transform, closely related to the DFT, that is involutary — that 
is, which is its own inverse. In particular, T(x) = ^(x*) /vN is clearly its own inverse: T(T(x)) = X- A 

closely related involutary transformation (by a factor of (1+0 Nl) is HCx) = ^((1 + 0 X *) / v2iV> since the 
(1 + i) factors in H (i?(x)) cancel the 2. For real inputs X, the real part of if(x)is none other than the 
discrete Hartley transform, which is also involutary. 

Eigenvalues and eigenvectors 

The eigenvalues of the DFT matrix are simple and well-known, whereas the eigenvectors are complicated, not 
unique, and are the subject of ongoing research. 

Consider the unitary form U defined above for the DFT of length N, where 

U - _ _L e -^(m-l)(«-l) 

m '" ~ y/N N ~ 
This matrix satisfies the equation: 

u 4 = i. 

This can be seen from the inverse properties above: operating XJtwice gives the original data in reverse order, so 
operating Ufour times gives back the original data and is thus the identity matrix. This means that the eigenvalues 
\ satisfy a characteristic equation: 

A 4 = 1. 

Therefore, the eigenvalues of XJ are me fourth roots of unity: \ is +1, -1, +/, or -/. 

Since there are only four distinct eigenvalues for this N X N matrix, they have some multiplicity. The multiplicity 
gives the number of linearly independent eigenvectors corresponding to each eigenvalue. (Note that there are TV 
independent eigenvectors; a unitary matrix is never defective.) 
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The problem of their multiplicity was solved by McClellan and Parks (1972), although it was later shown to have 
been equivalent to a problem solved by Gauss (Dickinson and Steiglitz, 1982). The multiplicity depends on the value 
of N modulo 4, and is given by the following table: 

Multiplicities of the eigenvalues X of the unitary DFT matrix U as a function of the 

transform size N (in terms of an integer m). 



size N 




A = -1 


A = -i 


A = +/ 


4m 


m + 1 


m 


m 


m - 1 


4m + 1 


m + 1 


m 


m 


m 


Am + 2 


m + 1 


m + 1 


m 


m 


4m + 3 


m + 1 


m + 1 


m + 1 


m 



No simple analytical formula for general eigenvectors is known. Moreover, the eigenvectors are not unique because 
any linear combination of eigenvectors for the same eigenvalue is also an eigenvector for that eigenvalue. Various 
researchers have proposed different choices of eigenvectors, selected to satisfy useful properties like orthogonality 
and to have "simple" forms (e.g., McClellan and Parks, 1972; Dickinson and Steiglitz, 1982; Grunbaum, 1982; 
Atakishiyev and Wolf, 1997; Candan et al, 2000; Hanna et al, 2004; Gurevich and Hadani, 2008). However two 
simple closed- form analytical eigenvectors for special DFT period N were found (Kong, 2008): 

For DFT period N = 2L + 1 = 4K +1, where K is an integer, the following is an eigenvector of DFT: 




For DFT period N=2L = 4K, where K is an integer, the following is an eigenvector of DFT: 




The choice of eigenvectors of the DFT matrix has become important in recent years in order to define a discrete 
analogue of the fractional Fourier transform — the DFT matrix can be taken to fractional powers by exponentiating 
the eigenvalues (e.g., Rubio and Santhanam, 2005). For the continuous Fourier transform, the natural orthogonal 
eigenfunctions are the Hermite functions, so various discrete analogues of these have been employed as the 
eigenvectors of the DFT, such as the Kravchuk polynomials (Atakishiyev and Wolf, 1997). The "best" choice of 
eigenvectors to define a fractional discrete Fourier transform remains an open question, however. 



The real-input DFT 

If Xq, . . . , X/v-iare real numbers, as they often are in practical applications, then the DFT obeys the symmetry: 

Xk = x* N _ k . 

The star denotes complex conjugation. The subscripts are interpreted modulo N. 

Therefore, the DFT output for real inputs is half redundant, and one obtains the complete information by only 
looking at roughly half of the outputs Xq, . . . , -Xjv - — 1- ^ n tms case > me "DC" element Xois purely real, and for 
even N the "Nyquist" element Xjy^is also real, so there are exactly N non-redundant real numbers in the first half 
+ Nyquist element of the complex output X. 

Using Euler's formula, the interpolating trigonometric polynomial can then be interpreted as a sum of sine and cosine 
functions. 
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Generalized/shifted DFT 

It is possible to shift the transform sampling in time and/or frequency domain by some real shifts a and b, 
respectively. This is sometimes known as a generalized DFT (or GDFT), also called the shifted DFT or offset 
DFT, and has analogous properties to the ordinary DFT: 

X k = £ x n e-^ k+b ^ n+a ^ k = 0, . . . , N - 1. 

n=0 

Most often, shifts of 1/2 (half a sample) are used. While the ordinary DFT corresponds to a periodic signal in both 
time and frequency domains, a = 1/2 produces a signal that is anti-periodic in frequency domain ( 
Xk+N — ~Xk ) and vice- versa for b = 1/2- Thus, the specific case of a = b = l/2is known as an odd-time 

2 

odd-frequency discrete Fourier transform (or O DFT). Such shifted transforms are most often used for symmetric 
data, to represent different boundary symmetries, and for real- symmetric data they correspond to different forms of 
the discrete cosine and sine transforms. 

Another interesting choice is a = b = — (N — l)/2, which is called the centered DFT (or CDFT). The 

centered DFT has the useful property that, when N is a multiple of four, all four of its eigenvalues (see above) have 
equal multiplicities (Rubio and Santhanam, 2005 ) L J 

The discrete Fourier transform can be viewed as a special case of the z-transform, evaluated on the unit circle in the 
complex plane; more general z-transforms correspond to complex shifts a and b above. 



Multidimensional DFT 

The ordinary DFT transforms a one-dimensional sequence or array £ n that is a function of exactly one discrete 
variable n. The multidimensional DFT of a multidimensional array ^ni,n 2 ,...,n d that is a function of d discrete 
variables ri£ = 0, 1, . . . , N% — lfor £ in 1, 2, . . . , d is defined by: 



iVi-l / JV 2 -1 / N d -1 

N d ' ^rii,n2,...,n d 

ni=0 \ rti — 0 \ 71^=0 




where u; Ne — exp(— 2ni/ Np) as above and the d output indices run from hi = 0, 1, . . . , — 1 . This is 
more compactly expressed in vector notation, where we define n = (ni, . . . , Tid) and 
k = (ki, k>2, . . . , &d)as d-dimensional vectors of indices from 0 to N — 1, which we define as 
N-l = (JVi -l,JV 2 -l 3 ... 5 iV rf -l): 

N-l 



-27rik-(n/N) 



n=0 

where the division n/Nis defined as n/N = {ni/Ni, . . . , n^/N^to be performed element- wise, and the 
sum denotes the set of nested summations above. 

The inverse of the multi-dimensional DFT is, analogous to the one-dimensional case, given by: 

N-l 



IL=i N e k =o 

As the one-dimensional DFT expresses the input x n as a superposition of sinusoids, the multidimensional DFT 
expresses the input as a superposition of plane waves, or sinusoids. The direction of oscillation in space is k/N. 

The amplitudes are X\^. This decomposition is of great importance for everything from digital image processing 
(two-dimensional) to solving partial differential equations. The solution is broken up into plane waves. 
The multidimensional DFT can be computed by the composition of a sequence of one-dimensional DFTs along each 
dimension. In the two-dimensional case ^ni,n 2 the N\ independent DFTs of the rows (i.e., along TI2) are 
computed first to form a new array 2/ni,fc 2 . Then the ^independent DFTs of y along the columns (along are 
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computed to form the final result X^^. Alternatively the columns can be computed first and then the rows. The order is 
immaterial because the nested summations above commute. 

An algorithm to compute a one-dimensional DFT is thus sufficient to efficiently compute a multidimensional DFT. 
This approach is known as the row-column algorithm. There are also intrinsically multidimensional FFT algorithms. 

The real-input multidimensional DFT 

For input data x ni,n 2 ,...,n d consisting of real numbers, the DFT outputs have a conjugate symmetry similar to the 
one-dimensional case above: 

-Xfci.fca,...,**! = ^N 1 -k 1 ,N 2 -k 2 ,...,N d -k d ^ 
where the star again denotes complex conjugation and the £ -th subscript is again interpreted modulo Ni (for 

£ — X,2,....(/). 

Applications 

The DFT has seen wide usage across a large number of fields; we only sketch a few examples below (see also the 
references at the end). All applications of the DFT depend crucially on the availability of a fast algorithm to compute 
discrete Fourier transforms and their inverses, a fast Fourier transform. 

Spectral analysis 

When the DFT is used for spectral analysis, the {x n } sequence usually represents a finite set of uniformly -spaced 

time-samples of some signal x(t), where t represents time. The conversion from continuous time to samples 

(discrete-time) changes the underlying Fourier transform of x(t) into a discrete-time Fourier transform (DTFT), 
which generally entails a type of distortion called aliasing. Choice of an appropriate sample-rate (see Nyquist 
frequency) is the key to minimizing that distortion. Similarly, the conversion from a very long (or infinite) sequence 
to a manageable size entails a type of distortion called leakage, which is manifested as a loss of detail (aka 
resolution) in the DTFT. Choice of an appropriate sub- sequence length is the primary key to minimizing that effect. 
When the available data (and time to process it) is more than the amount needed to attain the desired frequency 
resolution, a standard technique is to perform multiple DFTs, for example to create a spectrogram. If the desired 
result is a power spectrum and noise or randomness is present in the data, averaging the magnitude components of 
the multiple DFTs is a useful procedure to reduce the variance of the spectrum (also called a periodogram in this 
context); two examples of such techniques are the Welch method and the Bartlett method; the general subject of 
estimating the power spectrum of a noisy signal is called spectral estimation. 

A final source of distortion (or perhaps illusion) is the DFT itself, because it is just a discrete sampling of the DTFT, 
which is a function of a continuous frequency domain. That can be mitigated by increasing the resolution of the 
DFT. That procedure is illustrated in the discrete-time Fourier transform article. 

• The procedure is sometimes referred to as zero-padding, which is a particular implementation used in conjunction 
with the fast Fourier transform (FFT) algorithm. The inefficiency of performing multiplications and additions 
with zero-valued "samples" is more than offset by the inherent efficiency of the FFT. 

• As already noted, leakage imposes a limit on the inherent resolution of the DTFT. So there is a practical limit to 
the benefit that can be obtained from a fine-grained DFT. 
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Data compression 

The field of digital signal processing relies heavily on operations in the frequency domain (i.e. on the Fourier 
transform). For example, several lossy image and sound compression methods employ the discrete Fourier 
transform: the signal is cut into short segments, each is transformed, and then the Fourier coefficients of high 
frequencies, which are assumed to be unnoticeable, are discarded. The decompressor computes the inverse transform 
based on this reduced number of Fourier coefficients. (Compression applications often use a specialized form of the 
DFT, the discrete cosine transform or sometimes the modified discrete cosine transform.) 

Partial differential equations 

Discrete Fourier transforms are often used to solve partial differential equations, where again the DFT is used as an 
approximation for the Fourier series (which is recovered in the limit of infinite N). The advantage of this approach is 
that it expands the signal in complex exponentials e inx , which are eigenfunctions of differentiation: dldx e inx = in e inx . 
Thus, in the Fourier representation, differentiation is simple — we just multiply by / n. (Note, however, that the 
choice of n is not unique due to aliasing; for the method to be convergent, a choice similar to that in the 
trigonometric interpolation section above should be used.) A linear differential equation with constant coefficients is 
transformed into an easily solvable algebraic equation. One then uses the inverse DFT to transform the result back 
into the ordinary spatial representation. Such an approach is called a spectral method. 

Polynomial multiplication 

Suppose we wish to compute the polynomial product c(x) = a(x) • b(x). The ordinary product expression for the 
coefficients of c involves a linear (acyclic) convolution, where indices do not "wrap around." This can be rewritten 
as a cyclic convolution by taking the coefficient vectors for a(x) and b(x) with constant term first, then appending 
zeros so that the resultant coefficient vectors a and b have dimension d > deg(a(x)) + deg(Z?(x)). Then, 

c = a * b 

Where c is the vector of coefficients for c(x), and the convolution operator * is defined so 



Here the vector product is taken elementwise. Thus the coefficients of the product polynomial c(x) are just the terms 
0, deg(a(x)) + deg(Z?(x)) of the coefficient vector 



With a fast Fourier transform, the resulting algorithm takes O (N log AO arithmetic operations. Due to its simplicity 
and speed, the Cooley— Tukey FFT algorithm, which is limited to composite sizes, is often chosen for the transform 
operation. In this case, d should be chosen as the smallest integer greater than the sum of the input polynomial 
degrees that is factorizable into small prime factors (e.g. 2, 3, and 5, depending upon the FFT implementation). 



d-1 




n = 0, 1 . . . , d — 1 



m=0 

But convolution becomes multiplication under the DFT: 



F{c) = J 27 (a).F(b) 



c = F-\T(aL)T{h)). 
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Multiplication of large integers 

The fastest known algorithms for the multiplication of very large integers use the polynomial multiplication method 
outlined above. Integers can be treated as the value of a polynomial evaluated specifically at the number base, with 
the coefficients of the polynomial corresponding to the digits in that base. After polynomial multiplication, a 
relatively low -complexity carry-propagation step completes the multiplication. 



Some discrete Fourier transform pairs 



Some DFT pairs 



-i /V-1 
fc=0 


A/-1 

71=0 


Note 


i'2irne/N 


Afc-€ 


Shift theorem 




y -i2nk£/N 




Afc = A^_ fc 


Real DFT 




f A if a = e* 2 ^ 
1 l-al-^xfc/A- otherwise 


from the geometric progression formula 


(V) 




from the binomial theorem 


(± if 2n < W or 2(A r - n) < W 
1 0 otherwise 


f 1 if A; = 0 

— v f.\ otherwise 


%n is a rectangular window function of W points centered on 
%0, where Wis an odd integer, and Xj. is a sinc-like function 



Derivation as Fourier series 

The DFT can be derived as a truncation of the Fourier series of a periodic sequence of impulses. 

Generalizations 

Representation theory 

The DFT can be interpreted as the complex-valued representation theory of the finite cyclic group. In other words, a 
sequence of n complex numbers can be thought of as an element of /i-dimensional complex space C n , or 

equivalently a function from the finite cyclic group of order n to the complex numbers, Z/nZ — > C.This latter 
may be suggestively written (jZ/nZt 0 emphasize that this is a complex vector space whose coordinates are indexed 
by the /i-element set Z/nZ. 

From this point of view, one may generalize the DFT to representation theory generally, or more narrowly to the 
representation theory of finite groups. 

More narrowly still, one may generalize the DFT by either changing the target (taking values in a field other than the 
complex numbers), or the domain (a group other than a finite cyclic group), as detailed in the sequel. 
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Other fields 

Many of the properties of the DFT only depend on the fact that e -^is a primitive root of unity, sometimes 
denoted t^jvor W^(so that u;^ = 1). Such properties include the completeness, orthogonality, 

Plancherel/Parseval, periodicity, shift, convolution, and unitarity properties above, as well as many FFT algorithms. 
For this reason, the discrete Fourier transform can be defined by using roots of unity in fields other than the complex 
numbers, and such generalizations are commonly called number- theoretic transforms (NTTs) in the case of finite 
fields. For more information, see number-theoretic transform and discrete Fourier transform (general). 

Other finite groups 

The standard DFT acts on a sequence x , x^ . . ., jc of complex numbers, which can be viewed as a function {0, 1, 
...,N- 1 } — > C. The multidimensional DFT acts on multidimensional sequences, which can be viewed as functions 

{0, 1, . . . , JVi - 1} x - - - x {0, 1, . . . , N d - 1} -> C. 

This suggests the generalization to Fourier transforms on arbitrary finite groups, which act on functions G — » C 
where G is a finite group. In this framework, the standard DFT is seen as the Fourier transform on a cyclic group, 
while the multidimensional DFT is a Fourier transform on a direct sum of cyclic groups. 

Alternatives 

As with other Fourier transforms, there are various alternatives to the DFT for various applications, prominent 
among which are wavelets. The analog of the DFT is the discrete wavelet transform (DWT). From the point of view 
of time— frequency analysis, a key limitation of the Fourier transform is that it does not include location information, 
only frequency information, and thus has difficulty in representing transients. As wavelets have location as well as 
frequency, they are better able to represent location, at the expense of greater difficulty representing frequency. For 
details, see comparison of the discrete wavelet transform with the discrete Fourier transform. 

See also 

• DFT matrix 

• Fast Fourier transform 

• List of Fourier-related transforms 

• FFTW 
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Fast Fourier transform 



A fast Fourier transform (FFT) is an efficient algorithm to compute the discrete Fourier transform (DFT) and its 
inverse. There are many distinct FFT algorithms involving a wide range of mathematics, from simple 
complex-number arithmetic to group theory and number theory; this article gives an overview of the available 
techniques and some of their general properties, while the specific algorithms are described in subsidiary articles 
linked below. 

A DFT decomposes a sequence of values into components of different frequencies. This operation is useful in many 
fields (see discrete Fourier transform for properties and applications of the transform) but computing it directly from 
the definition is often too slow to be practical. An FFT is a way to compute the same result more quickly: computing 

2 

a DFT of N points in the naive way, using the definition, takes 0(N ) arithmetical operations, while an FFT can 
compute the same result in only 0(N log AO operations. The difference in speed can be substantial, especially for 
long data sets where N may be in the thousands or millions — in practice, the computation time can be reduced by 
several orders of magnitude in such cases, and the improvement is roughly proportional to N/log(N). This huge 
improvement made many DFT-based algorithms practical; FFTs are of great importance to a wide variety of 
applications, from digital signal processing and solving partial differential equations to algorithms for quick 
multiplication of large integers. 

The most well known FFT algorithms depend upon the factorization of N, but (contrary to popular misconception) 
there are FFTs with 0(N log AO complexity for all N, even for prime N. Many FFT algorithms only depend on the 
fact that e -^pis an 7V m primitive root of unity, and thus can be applied to analogous transforms over any finite 

field, such as number-theoretic transforms. 

Since the inverse DFT is the same as the DFT, but with the opposite sign in the exponent and a l/N factor, any FFT 
algorithm can easily be adapted for it. 

Definition and speed 

An FFT computes the DFT and produces exactly the same result as evaluating the DFT definition directly; the only 
difference is that an FFT is much faster. (In the presence of round- off error, many FFT algorithms are also much 
more accurate than evaluating the DFT definition directly, as discussed below.) 

Let „ • ••••• JC - _ . be complex numbers. The DFT is defined bv the formula 

0 N-l r J 

N-l 

X k =Y, x n e~ i2nk % k = 0, . . . , N - 1. 

n=0 

2 

Evaluating this definition directly requires 0(N ) operations: there are N outputs X , and each output requires a sum 
of N terms. An FFT is any method to compute the same results in 0(N log AO operations. More precisely, all known 
FFT algorithms require S(N log AO operations (technically, O only denotes an upper bound), although there is no 
proof that better complexity is impossible. 

To illustrate the savings of an FFT, consider the count of complex multiplications and additions. Evaluating the 

2 

DFT's sums directly involves N complex multiplications and N(N - 1) complex additions [of which 0(N) operations 
can be saved by eliminating trivial operations such as multiplications by 1]. The well-known radix-2 Cooley-Tukey 
algorithm, for N a power of 2, can compute the same result with only (N/2) log 2 N complex multiplies (again, 
ignoring simplifications of multiplications by 1 and similar) and Nlog^N complex additions. In practice, actual 
performance on modern computers is usually dominated by factors other than arithmetic and is a complicated subject 

2 

(see, e.g., Frigo & Johnson, 2005), but the overall improvement from S(N ) to 0(AHog AO remains. 
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Computational issues 

Bounds on complexity and operation counts 

A fundamental question of longstanding theoretical interest is to prove lower bounds on the complexity and exact 
operation counts of fast Fourier transforms, and many open problems remain. It is not even rigorously proved 
whether DFTs truly require Q(N log TV) (i.e., order TV log TV or greater) operations, even for the simple case of 

power of two sizes, although no algorithms with lower complexity are known. In particular, the count of arithmetic 
operations is usually the focus of such questions, although actual performance on modern-day computers is 
determined by many other factors such as cache or CPU pipeline optimization. 

Following pioneering work by Winograd (1978), a tight ©(TV) lower bound is known for the number of real 

multiplications required by an FFT. It can be shown that only 4TV — 2 log 2 TV — 2 log 2 TV — 4 irrational real 

multiplications are required to compute a DFT of power-of-two length TV = 2 m • Moreover, explicit algorithms 

that achieve this count are known (Heideman & Burrus, 1986; Duhamel, 1990). Unfortunately, these algorithms 

require too many additions to be practical, at least on modern computers with hardware multipliers. 

A tight lower bound is not known on the number of required additions, although lower bounds have been proved 

under some restrictive assumptions on the algorithms. In 1973, Morgenstern proved an £}(N log TV) lower bound 

on the addition count for algorithms where the multiplicative constants have bounded magnitudes (which is true for 
most but not all FFT algorithms). Pan (1986) proved an fi( TV log TV) lower bound assuming a bound on a measure 

of the FFT algorithm's "asynchronicity", but the generality of this assumption is unclear. For the case of 
power-of-two 7V> Papadimitriou (1979) argued that the number TV log 2 TV of complex- number additions achieved 

by Cooley-Tukey algorithms is optimal under certain assumptions on the graph of the algorithm (his assumptions 
imply, among other things, that no additive identities in the roots of unity are exploited). (This argument would 
imply that at least 2 TV log 2 TV real additions are required, although this is not a tight bound because extra additions 

are required as part of complex-number multiplications.) Thus far, no published FFT algorithm has achieved fewer 
than TV logo TV complex -number additions (or their equivalent) for power-of-two TV • 

A third problem is to minimize the total number of real multiplications and additions, sometimes called the 
"arithmetic complexity" (although in this context it is the exact count and not the asymptotic complexity that is being 
considered). Again, no tight lower bound has been proven. Since 1968, however, the lowest published count for 
power-of-two TV was l° n g achieved by the split-radix FFT algorithm, which requires 4 TV log 2 TV — 6 TV + 8 real 

34 

multiplications and additions for TV > 1 • This was recently reduced to <~ — TV log 2 TV (Johnson and Frigo, 
2007; Lundy and Van Buskirk, 2007). 

Most of the attempts to lower or prove the complexity of FFT algorithms have focused on the ordinary complex- data 
case, because it is the simplest. However, complex-data FFTs are so closely related to algorithms for related 
problems such as real-data FFTs, discrete cosine transforms, discrete Hartley transforms, and so on, that any 
improvement in one of these would immediately lead to improvements in the others (Duhamel & Vetterli, 1990). 



Accuracy and approximations 

All of the FFT algorithms discussed below compute the DFT exactly (in exact arithmetic, i.e. neglecting 
floating-point errors). A few "FFT" algorithms have been proposed, however, that compute the DFT approximately, 
with an error that can be made arbitrarily small at the expense of increased computations. Such algorithms trade the 
approximation error for increased speed or other properties. For example, an approximate FFT algorithm by 
Edelman et al. (1999) achieves lower communication requirements for parallel computing with the help of a fast 
multipole method. A wavelet-based approximate FFT by Guo and Burrus (1996) takes sparse inputs/outputs 
(time/frequency localization) into account more efficiently than is possible with an exact FFT. Another algorithm for 
approximate computation of a subset of the DFT outputs is due to Shentov et al. (1995). Only the Edelman algorithm 
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works equally well for sparse and non-sparse data, however, since it is based on the compressibility (rank deficiency) 
of the Fourier matrix itself rather than the compressibility (sparsity) of the data. 

Even the "exact" FFT algorithms have errors when finite-precision floating-point arithmetic is used, but these errors 
are typically quite small; most FFT algorithms, e.g. Cooley-Tukey, have excellent numerical properties. The upper 

3/2 

bound on the relative error for the Cooley-Tukey algorithm is 0(8 log AO, compared to 0(eAf ) for the naive DFT 
formula (Gentleman and Sande, 1966), where 8 is the machine floating-point relative precision. In fact, the root 
mean square (rms) errors are much better than these upper bounds, being only 0(8 Vlog AO for Cooley-Tukey and 
0(8 VAO for the naive DFT (Schatzman, 1996). These results, however, are very sensitive to the accuracy of the 
twiddle factors used in the FFT (i.e. the trigonometric function values), and it is not unusual for incautious FFT 
implementations to have much worse accuracy, e.g. if they use inaccurate trigonometric recurrence formulas. Some 
FFTs other than Cooley-Tukey, such as the Rader-Brenner algorithm, are intrinsically less stable. 

In fixed-point arithmetic, the finite-precision errors accumulated by FFT algorithms are worse, with rms errors 
growing as 0(VAO for the Cooley-Tukey algorithm (Welch, 1969). Moreover, even achieving this accuracy requires 
careful attention to scaling in order to minimize the loss of precision, and fixed-point FFT algorithms involve 
rescaling at each intermediate stage of decompositions like Cooley-Tukey. 

To verify the correctness of an FFT implementation, rigorous guarantees can be obtained in 0(N log AO time by a 
simple procedure checking the linearity, impulse-response, and time-shift properties of the transform on random 
inputs (Ergtin, 1995). 



Algorithms 

Cooley-Tukey algorithm 

By far the most common FFT is the Cooley-Tukey algorithm. This is a divide and conquer algorithm that 
recursively breaks down a DFT of any composite size N = N N into many smaller DFTs of sizes N and N , along 
with 0(AO multiplications by complex roots of unity traditionally called twiddle factors (after Gentleman and Sande, 
1966). 

This method (and the general idea of an FFT) was popularized by a publication of J. W. Cooley and J. W. Tukey in 
1965, but it was later discovered (Heideman & Burrus, 1984) that those two authors had independently re-invented 
an algorithm known to Carl Friedrich Gauss around 1805 (and subsequently rediscovered several times in limited 
forms). 

The most well-known use of the Cooley-Tukey algorithm is to divide the transform into two pieces of size 7V/2at 

each step, and is therefore limited to power-of-two sizes, but any factorization can be used in general (as was known 
to both Gauss and Cooley/Tukey). These are called the radix-2 and mixed-radix cases, respectively (and other 
variants such as the split-radix FFT have their own names as well). Although the basic idea is recursive, most 
traditional implementations rearrange the algorithm to avoid explicit recursion. Also, because the Cooley-Tukey 
algorithm breaks the DFT into smaller DFTs, it can be combined arbitrarily with any other algorithm for the DFT, 
such as those described below. 

Other FFT algorithms 

There are other FFT algorithms distinct from Cooley-Tukey. For TV = Ni A^with coprime A^and jV*2, one can 
use the Prime-Factor (Good-Thomas) algorithm (PFA), based on the Chinese Remainder Theorem, to factorize the 
DFT similarly to Cooley-Tukey but without the twiddle factors. The Rader-Brenner algorithm (1976) is a 
Cooley— Tukey-like factorization but with purely imaginary twiddle factors, reducing multiplications at the cost of 
increased additions and reduced numerical stability; it was later superseded by the split-radix variant of 
Cooley-Tukey (which achieves the same multiplication count but with fewer additions and without sacrificing 
accuracy). Algorithms that recursively factorize the DFT into smaller operations other than DFTs include the Bruun 



Fast Fourier transform 



46 



and QFT algorithms. (The Rader-Brenner and QFT algorithms were proposed for power-of-two sizes, but it is 
possible that they could be adapted to general composite n . Bruun's algorithm applies to arbitrary even composite 
sizes.) Bruun's algorithm, in particular, is based on interpreting the FFT as a recursive factorization of the 
polynomial Z N _ here into real- coefficient polynomials of the form Z M _ \md Z 2M + az M + 1- 

Another polynomial viewpoint is exploited by the Winograd algorithm, which factorizes Z N _ \ into cyclotomic 

polynomials — these often have coefficients of 1, 0, or -1, and therefore require few (if any) multiplications, so 
Winograd can be used to obtain minimal-multiplication FFTs and is often used to find efficient algorithms for small 
factors. Indeed, Winograd showed that the DFT can be computed with only O(TV) irrational multiplications, 

leading to a proven achievable lower bound on the number of multiplications for power-of-two sizes; unfortunately, 
this comes at the cost of many more additions, a tradeoff no longer favorable on modern processors with hardware 
multipliers. In particular, Winograd also makes use of the PFA as well as an algorithm by Rader for FFTs of prime 
sizes. 

Rader' s algorithm, exploiting the existence of a generator for the multiplicative group modulo prime TV » expresses a 
DFT of prime size n as a cyclic convolution of (composite) size TV — 1 , which can then be computed by a pair of 
ordinary FFTs via the convolution theorem (although Winograd uses other convolution methods). Another 
prime- size FFT is due to L. I. Bluestein, and is sometimes called the chirp-z algorithm; it also re-expresses a DFT as 
a convolution, but this time of the same size (which can be zero-padded to a power of two and evaluated by radix-2 
Cooley-Tukey FFTs, for example), via the identity nk = — {k — ^) 2 /2 + n 2 /2 + fc 2 /2- 

FFT algorithms specialized for real and/or symmetric data 

In many applications, the input data for the DFT are purely real, in which case the outputs satisfy the symmetry 

and efficient FFT algorithms have been designed for this situation (see e.g. Sorensen, 1987). One approach consists 
of taking an ordinary algorithm (e.g. Cooley-Tukey) and removing the redundant parts of the computation, saving 
roughly a factor of two in time and memory. Alternatively, it is possible to express an even-length real-input DFT as 
a complex DFT of half the length (whose real and imaginary parts are the even/odd elements of the original real 
data), followed by O(A0 post-processing operations. 

It was once believed that real-input DFTs could be more efficiently computed by means of the discrete Hartley 
transform (DHT), but it was subsequently argued that a specialized real-input DFT algorithm (FFT) can typically be 
found that requires fewer operations than the corresponding DHT algorithm (FHT) for the same number of inputs. 
Bruun's algorithm (above) is another method that was initially proposed to take advantage of real inputs, but it has 
not proved popular. 

There are further FFT specializations for the cases of real data that have even/odd symmetry, in which case one can 
gain another factor of (roughly) two in time and memory and the DFT becomes the discrete cosine/sine transform(s) 
(DCT/DST). Instead of directly modifying an FFT algorithm for these cases, DCTs/DSTs can also be computed via 
FFTs of real data combined with O(A0 pre/post processing. 
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Multidimensional FFTs 

As defined in the multidimensional DFT article, the multidimensional DFT 
X k = £ e- 2 ™ k >/ N >x n 

ii=() 

transforms an array £ n with a ^-dimensional vector of indices n = {n\ } n2, . . . , fldjhy a set of d nested 

summations (over rij = 0 . . . Nj — lfor each j), where the division n/N, defined as 

n/N = {jli/Ni, - • • ^Tld/Nd), is performed element- wise. Equivalently, it is simply the composition of a 

sequence of d sets of one-dimensional DFTs, performed along one dimension at a time (in any order). 

This compositional viewpoint immediately provides the simplest and most common multidimensional DFT 

algorithm, known as the row-column algorithm (after the two-dimensional case, below). That is, one simply 
performs a sequence of d one-dimensional FFTs (by any of the above algorithms): first you transform along the Ti\ 
dimension, then along the Tl2 dimension, and so on (or actually, any ordering will work). This method is easily 
shown to have the usual O (TV log TV) complexity, where ./V = TV1TV2 ■ ■ ■ TV^is the total number of data points 
transformed. In particular, there are TV/ N\ transforms of size N\, etcetera, so the complexity of the sequence of 
FFTs is: 

TV TV 

—0{Ni log Ni) H h w O(N d log N d ) 

= O (TV [log Ni -\ h log N d }) = 0{N log N). 

In two dimensions, the ^kcan be viewed as an Ti\ X TI2 matrix, and this algorithm corresponds to first performing 
the FFT of all the rows and then of all the columns (or vice versa), hence the name. 

In more than two dimensions, it is often advantageous for cache locality to group the dimensions recursively. For 
example, a three-dimensional FFT might first perform two-dimensional FFTs of each planar "slice" for each fixed 
Tli, and then perform the one-dimensional FFTs along the ^ldirection. More generally, an asymptotically optimal 
cache-oblivious algorithm consists of recursively dividing the dimensions into two groups (ni, • • • , fi d /2 )and 

( n d/2+l ? ' ' " j ^d) m at are transformed recursively (rounding if d is not even) (see Frigo and Johnson, 2005). Still, 

this remains a straightforward variation of the row-column algorithm that ultimately requires only a one-dimensional 
FFT algorithm as the base case, and still has O (TV log TV) complexity. Yet another variation is to perform matrix 

transpositions in between transforming subsequent dimensions, so that the transforms operate on contiguous data; 
this is especially important for out-of-core and distributed memory situations where accessing non-contiguous data is 
extremely time-consuming. 

There are other multidimensional FFT algorithms that are distinct from the row-column algorithm, although all of 
them have O (TV log TV) complexity. Perhaps the simplest non-row-column FFT is the vector-radix FFT algorithm, 

which is a generalization of the ordinary Cooley— Tukey algorithm where one divides the transform dimensions by a 
vector r = (ri, r2, ■ • • , Td)of radices at each step. (This may also have cache benefits.) The simplest case of 

vector-radix is where all of the radices are equal (e.g. vector-radix-2 divides all of the dimensions by two), but this is 
not necessary. Vector radix with only a single non-unit radix at a time, i.e. r = (1, • • ■ , 1,7", 1, • * • ,1), is 

essentially a row-column algorithm. Other, more complicated, methods include polynomial transform algorithms due 
to Nussbaumer (1977), which view the transform in terms of convolutions and polynomial products. See Duhamel 
and Vetterli (1990) for more information and references. 
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Other generalizations 

5/2 2 2 

An 0(N log AO generalization to spherical harmonics on the sphere S with N nodes was described by 

2 2 

Mohlenkamp (1999), along with an algorithm conjectured (but not proven) to have 0(N log AO complexity; 

m 2 

Mohlenkamp also provides an implementation in the libftsh library .A spherical- harmonic algorithm with 0(N 
log AO complexity is described by Rokhlin and Tygert (2006). 

Various groups have also published "FFT" algorithms for non-equispaced data, as reviewed in Potts et al. (2001). 
Such algorithms do not strictly compute the DFT (which is only defined for equispaced data), but rather some 
approximation thereof (a non-uniform discrete Fourier transform, or NDFT, which itself is often computed only 
approximately). 

See also 

• Split-radix FFT algorithm 

• Prime-factor FFT algorithm 

• Bruun's FFT algorithm 

• Rader's FFT algorithm 

• Bluestein's FFT algorithm 

• Butterfly diagram - a diagram used to describe FFTs. 

• Odlyzko-Schonhage algorithm applies the FFT to finite Dirichlet series. 

• Overlap add/Overlap save - efficient convolution methods using FFT for long signals 

• Spectral music (involves application of FFT analysis to musical composition) 

• Spectrum analyzers - Devices that perform an FFT 

• FFTW "Fastest Fourier Transform in the West" - 'C library for the discrete Fourier transform (DFT) in one or 
more dimensions. 

• Time Series 

• Math Kernel Library 
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